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TITLE : Multiple Marker Assay for Detection of Ovarian Cancer 
FIELD OF THE INVENTION 

The invention relates to compositions, kits, and methods for detecting, characterizing, preventing, 
and treating ovarian cancer. 
5 BACKGROUND OF THE INVENTION 

Epithelial ovarian carcinoma is the most common and most lethal of all gynecologic malignancies. 
Only 30% of ovarian tumors are diagnosed at an early stage (Stage I/II), when survival rates reach 90%. The 
rest are diagnosed at an advanced stage, with survival rates of less than 20% ( Greenlee RT, Hill-Harmon 
MB, Murray T, et al., 2001. CA Cancer J Clin .2001;51:15-36). Currently, the only well-accepted 
10 serological marker is CA125, a large glycoprotein of unknown function (Meyer T, Rustin GJ., Br J Cancer 
.2000;82:1535-1538). However, CA125 has limitations as a diagnostic, prognostic and screening tool 
(Holschneider CH, Berek JS, Semin Surg Oncol .2000;19:3-10). Consequently, there is a need to enhance 
the overall diagnostic/prognostic capability of CA125. 

Kallikreins are a subgroup of secreted serine proteases, encoded by highly conserved and tightly 

15 clustered multigene families in humans, rats and mice. The human kallikrein gene family resides on 
chromosome 19ql3.4 and is comprised of 15 members, whose genes are designated as KLK1 to KLK15 and 
the corresponding proteins as hKl to hK15 ( Yousef GM, Diamandis EP., Endocr Rev .2001;22:184-2041; 
Yousef GM, Chang A, Scorilas A, et al., Biochem Biophys Res Commun. 2000;276:125-133; Diamandis EP, 
Yousef GM, Clements J, et al. Clin Chem .2000;46:1855-1858). Kallikreins are expressed in a wide variety 

20 of tissues and are found in many biological fluids (e.g. cerebrospinal fluid, serum, seminal plasma, milk, 
etc.) where they are predicted to process specific substrates. Kallikreins may participate in cascade reactions 
similar to those involved in digestion, fibrinolysis, coagulation, wound healing and apoptosis (( Yousef GM, 
Diamandis EP., Endocr Rev .2001;22:184-2041). Many kallikreins have been found to be differentially 
expressed in endocrine-related malignancies (Diamandis EP, Yousef GM, Expert Rev. Mol. Diagn 

25 .2001;1:182-190), including prostate ( Barry MJ. Clinical practice, N Engl J Med .2001;344:1373-1377; 
Rittenhouse HG, Finlay JA, Mikolajczyk SD, et al., Crit Rev Clin Lab Sci .1998;35:275-368; and Yousef 
GM, Scorilas A, Jung K, et al., J Biol Chem .2001;276:53-61), ovarian ( Kim H, Scorilas A, Katsaros D, et 
al., Br J Cancer, 2001;84:643-650; Anisowicz A, Sotiropoulou G, Stenman, et al., Mol Med . 1996; 2:624- 
636; Tanimoto H, Underwood LJ, Shigemasa K, et al.,. Cancer . 1999;86:2074-2082; Magklara A, Scorilas 

30 A, Katsaros D, et al., Clin Cancer Res .2001;7:806-811; Yousef GM, Kyriakopoulou LG, Scorilas A, et al., 
Cancer Res .2001;61:7811-7818; Luo L, Bunting P, Scorilas A, Diamandis EP., Clin Chim Acta 
.2001;306:111-118), breast ( Yousef GM, Magklara A, Chang A, et al., Cancer Res .2001;61:3425- 
3431;Yousef GM, Chang A, Diamandis EP; J Biol Chem .2000;275:11891-11898; and Yousef GM, 
Magklara A, Diamandis EP, Genomics .2000;69:331-341), and testicular cancer ( Luo LY, Rajpert-De Meyts 

35 ER, Jung K, et al.,2001; 85:220-224). In addition, many kallikrein genes examined thus far are under steroid 
hormone regulation, implicating a role for kallikreins in endrocrine-related tissues (Yousef GM, Diamandis 
EP., Endocr Rev., 2001;22:184-204). Furthermore, hK6, hK10 and hKll have been recently identified as 
novel serological ovarian cancer biomarkers ( Luo L, Bunting P, Scorilas A, Diamandis EP., Clin Chim Acta 
.2001;306:111-118 Diamandis EP, Yousef GM, Soosaipillai AR, Bunting P., Clin Biochem. 2000;33:579- 
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583, and Diamandis EP, Okui A, Mitsui S, et al., Cancer Res .2002;62:295-300). 
SUMMARY OF THE INVENTION 

The present invention seeks to overcome the drawbacks inherent in the prior art and seeks to 
provide sensitive and accurate multimarker methods for the detection of ovarian cancer. A plurality of 
5 kallikrein polypeptides and polynucleotides encoding the polypeptides, optionally in combination with 
CA125 and polynucleotides encoding CA125 can have particular application in the detection of ovarian 
cancer. A plurality of kallikrein markers (i.e. two or more of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 
8, kallikrein 10, and kallikrein 11) and polynucleotides encoding the polypeptides, optionally in combination 
with CA125 and polynucleotides encoding CA125, constitute biomarkers for the diagnosis, monitoring, 
10 progression, treatment, and prognosis of ovarian cancer, and they may be used as biomarkers before surgery 
or after relapse. 

In accordance with the methods of the invention, the presence of levels of markers in a sample can 
be assessed, for example by detecting the presence in the sample of (a) polypeptides or polypeptide 
fragments corresponding to the markers; (b) metabolites which are produced directly or indirectly by 
15 polypeptides corresponding to the markers; (c) transcribed nucleic acids or fragments thereof having at least 
a portion with which the markers are substantially identical; and/or (c) transcribed nucleic acids or fragments 
thereof, wherein the nucleic acids hybridize with the markers. 

In an aspect of the invention, a method is provided for detecting ovarian cancer in a patient 
comprising detecting a plurality of kallikrein polypeptides, optionally in combination with CA125, in a 
20 sample from the patient wherein the method provides substantially increased sensitivity compared to 
methods using CA125 alone. In an embodiment, sensitivity is increased by at least 0.5%, 1%, 2%, 3%, 4%, 
5%, 10%, 15%, 20%, 25%, 30%, and 35% compared to using CA125 alone. 

In an embodiment, the invention provides a method for detecting a plurality of kallikrein markers, 
and optionally CA125, associated with ovarian cancer in a patient comprising: 
25 (a) obtaining a sample from a patient; 

(b) detecting or identifying in the sample kallikrein markers, optionally in combination with 
CA125, wherein the kallikrein markers comprise or are selected from the group consisting 
of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(c) comparing the detected amounts with amounts detected for a standard. 

30 The term "detect" or "detecting" includes assaying, assessing, imaging or otherwise establishing the 

presence or absence of the target kallikrein and CA125 polypeptides or polynucleotides encoding the 
polypeptides, subunits thereof, or combinations of reagent bound targets, and the like, or assaying for, 
imaging, ascertaining, establishing, or otherwise determining one or more factual characteristics of ovarian 
cancer, metastasis, stage, or similar conditions. The term encompasses diagnostic, prognostic, and 

35 monitoring applications. The kallikrein polypeptides and CA125 can be detected individually, sequentially, 
or simultaneously. 

According to a method involving kallikrein markers optionally in combination with CA125, the 
levels in the sample of the kallikrein markers (2, 3, 4, 5, or 6) and optionally CA125, wherein the markers 
comprise or are selected from kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 



WO 2004/075713 



PCT/CA2004/000281 



-3- 

11, are compared with the normal levels of the kallikrein markers, and optionally CA125, in samples of the 
same type obtained from controls (e.g. samples from individuals not afflicted with ovarian cancer). 
Significantly different levels in the sample of the kallkrein markers (and optionally CA125) relative to the 
normal levels in a control is indicative of ovarian cancer. 
5 In an embodiment, the invention provides a method for diagnosing and monitoring ovarian 

carcinoma in a subject comprising detecting in a sample from the subject kallikrein markers, and optionally 
CA125, wherein the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. The kallikrein markers and CA125 can 
be detected using antibodies that bind to the kallikrein markers and CA125 or parts thereof. 
10 Thus, the invention provides a method of assessing whether a patient is afflicted with or has a pre- 

disposition for ovarian cancer, the method comprising comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a sample from the patient, wherein 
the kallikrein markers comprise kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11; and 
15 (b) normal levels of kallikrein markers, and optionally CA125, in samples of the same type 

obtained from control patients not afflicted with ovarian cancer, wherein significantly 
different levels of the kallikrein markers and optionally CA125, relative to the 
corresponding normal levels of the kallikrein markers, and optionally CA125, is an 
indication that the patient is afflicted with ovarian cancer. 
20 In an embodiment of a method of assessing whether a patient is afflicted with ovarian cancer (e.g. 

screening, detection of a recurrence, reflex testing), the method comprises comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 
25 (b) normal levels of the kallikrein markers, and optionally CA125, in a control non-ovarian 

cancer sample. 

A significant difference between the levels of the kallikrein markers, and optionally CA125, in the 
patient sample and the normal levels is an indication that the patient is afflicted with ovarian cancer. 

The invention further relates to a method of assessing the efficacy of a therapy for inhibiting 
30 ovarian cancer in a patient. This method comprises comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a first sample obtained from the 
patient prior to providing at least a portion of the therapy to the patient, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11 ; and 
35 (b) levels of the kallikrein markers, and optionally CA125, in a second sample obtained from 

the patient following therapy. 
A significant difference between the levels of the kallikrein markers, and optionally CA125, in the 
second sample, relative to the first sample, is an indication that the therapy is efficacious for inhibiting 
ovarian cancer. 
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The "therapy" may be any therapy for treating ovarian cancer including but not limited to 
chemotherapy, immunotherapy, gene therapy, radiation therapy, and surgical removal of tissue. Therefore, 
the method can be used to evaluate a patient before, during, and after therapy, for example, to evaluate the 
reduction in tumor burden. 

5 In an aspect, the invention provides a method for monitoring the progression of ovarian cancer in a 

patient, the method comprising: 

(a) detecting in a patient sample at a first time point, kallikrein markers, and optionally 
CA125, wherein the kallikrein markers comprise or are selected from the group consisting 
of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

10 (b) repeating step (a) at a subsequent point in time; and 

(c) comparing the levels detected in (a) and (b), and therefrom monitoring the progression of 

ovarian cancer in the patient. 
In another aspect, the invention provides a method for assessing the aggressiveness or indolence of 
ovarian cancer (e.g. staging), the method comprising comparing: 
15 (a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 

kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(b) normal levels of the kallikrein markers, and optionally CA125 in a control sample. 

A significant difference between the levels in the sample and the normal levels is an indication that 
20 the cancer is aggressive or indolent. 

The invention provides a method for determining whether ovarian cancer has metastasized or is 
likely to metastasize in the future, the method comprising comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 

25 kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(b) normal levels (or non-metastatic levels) of the kallikrein markers, and optionally CA125, 
in a control sample. 

A significant difference between the levels in the patient sample and the normal levels is an 
indication that the cancer has metastasized or is likely to metastasize in the future. 
30 The invention also provides a method for assessing the potential efficacy of a test agent for 

inhibiting ovarian cancer in a patient, and a method of selecting an agent for inhibiting ovarian cancer in a 
patient. 

The invention further provides a method of inhibiting ovarian cancer in a patient comprising: 
(a) obtaining a sample comprising cancer cells from the patient; 
35 (b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing levels of kallikrein markers, and optionally CA125, in each of the aliquots, 
wherein the kallikrein markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; 

(d) administering to the patient at least one of the test agents which alters the levels of the 
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kallikrein markers, and optionally CA125, in the aliquot containing that test agent, relative 
to other test agents. 

The invention also contemplates a method of assessing the ovarian carcinogenic potential of a test 
compound comprising: 

5 (a) maintaining separate aliquots of ovarian cells in the presence and absence of the test 

compound; and 

(b) comparing levels of kallikrein markers, and optionally CA125, in each of the aliquots, 
wherein the markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 
10 A significant difference between the levels of the kallikrein markers, and optionally CA125, in the 

aliquot maintained in the presence of (or exposed to) the test compound relative to the aliquot maintained in 
the absence of the test compound, indicates that the test compound possesses ovarian carcinogenic potential. 

In preferred embodiments of the methods of the invention, the kallikrein markers comprise a 
plurality of kallikrein markers, for example, at least three, four, five, or six of the markers. In particular, a 
15 plurality of kallikrein markers may be selected from the group consisting of kallikrein 5, kallikrein 7, 
kallikrein 8, and kallikrein 10, from the group consisting of kallikrein 7, kallikrein 8, kallikrein 10, and 
kallikrein 11, or from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 
and kallikrein 11. 

Other methods of the invention employ one or more polynucleotides capable of hybridizing to 
20 polynucleotides encoding kallikrein markers, and optionally CA125. Methods for detecting polynucleotides 
encoding a kallikrein markers, and optionally CA125, can be used to monitor ovarian cancer by detecting the 
nucleic acids. 

Thus, the present invention relates to a method for diagnosing and monitoring ovarian cancer in a 
sample from a subject comprising isolating nucleic acids, preferably mRNA, from the sample; and detecting 
25 polynucleotides encoding kallikrein markers, and optionally CA125, in the sample. The presence of different 
levels of polynucleotides encoding kallikrein markers, and optionally CA125, in the sample compared to a 
standard or control is indicative of disease, disease stage, and/or prognosis, e.g. longer progression-free and 
overall survival. 

In an embodiment, the invention provides methods for determining the presence or absence of 
30 ovarian cancer in a subject comprising (a) contacting a sample obtained from the subject with 
oligonucleotides that hybridize to polynucleotides encoding kallikrein markers, and optionally CA125; and 
(b) detecting in the sample levels of nucleic acids that hybridize to the polynucleotides relative to a 
predetermined cut-off value, and therefrom determining the presence or absence of ovarian cancer in the 
subject. Within certain embodiments, mRNA is detected via polymerase chain reaction using, for example 
35 oligonucleotide primers that hybridize to polynucleotides encoding kallikrein markers, and optionally 
CA125, or complements of such polynucleotides. Within other embodiments, the amount of mRNA is 
detected using a hybridization technique, employing oligonucleotide probes that hybridize to polynucleotides 
encoding kallikrein markers, and optionally CA125, or complements of such polynucleotides. 

When using mRNA detection, the method may be carried out by combining isolated mRNA with 
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reagents to convert to cDNA according to standard methods; treating the converted cDNA with amplification 
reaction reagents (such as cDNA PCR reaction reagents) in a container along with an appropriate mixture of 
nucleic acid primers; reacting the contents of the container to produce amplification products; and analyzing 
the amplification products to detect the presence of polynucleotides encoding kallikrein markers, and 
5 optionally CA125, in the sample. For mRNA the analyzing step may be accomplished using Northern Blot 
analysis to detect the presence of polynucleotides encoding kallikrein markers, and optionally CA125. The 
analysis step may be further accomplished by quantitatively detecting the presence of polynucleotides 
encoding kallikrein markers, and optionally CA125, in the amplification product, and comparing the quantity 
of markers detected against a panel of expected values for the known presence or absence of the kallikrein 
10 markers in normal and malignant tissue derived using similar primers. 

In embodiments of the methods of the invention, a plurality (eg. three, four, five or six) 
polynucleotides encoding kallikrein polypeptides are employed. In particular, a plurality of polynucleotides 
encoding kallikrein markers may be selected from the group consisting of polynucleotides encoding (i) 
kallikrein 5, kallikrein 7, kallikrein 8, and kallikrein 10; (ii) polynucleotides encoding kallikrein 7, kallikrein 
15 8, kallikrein 10, and kallikrein 11; and (hi) polynucleotides encloding kallikrein 5, kallikrein 6, kallikrein7, 
kallikrein 8, kallikrein 10 and kallikrein 11. 

The invention also provides a diagnostic composition comprising a plurality of kallikrein 
polypeptides and optionally CA125 polypeptide, or polynucleotides encoding the polypeptides, or agents that 
bind to the polypeptides or polynucleotides. 
20 In an embodiment, the composition comprises probes that specifically hybridize to polynucleotides 

encoding kallikrein markers, and optionally CA125, or fragments thereof. In another embodiment a 
composition is provided comprising specific primer pairs capable of amplifying polynucleotides encoding 
kallikrein markers, and optionally CA125, using polymerase chain reaction methodologies. In a still further 
embodiment, the composition comprises agents that bind to kallikrein markers, and optionally CA125, (e.g. 
25 antibodies) or fragments thereof. Probes, primers, and agents can be labeled with detectable substances. 

In an aspect the invention provides an in vivo method comprising administering to a subject agents 
that have been constructed to target kallikrein markers, and optionally CA125. 

The invention therefore contemplates an in vivo method comprising administering to a mammal 
imaging agents that carry labels for imaging and that bind to kallikrein markers, and optionally CA125, and 
30 then imaging the mammal. 

Still further the invention relates to therapeutic applications for ovarian cancer employing kallikrein 
markers, and optionally CA125, nucleic acids encoding the polypeptides, and/or agents identified using 
methods of the invention. 

The invention also includes kits for carrying out methods of the invention. In an embodiment, the 
35 kit is for assessing whether a patient is afflicted with ovarian cancer and it comprises reagents for assessing 
kallikrein markers, and optionally CA125, wherein the kallikrein markers comprise or are selected from the 
group consising of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1. 

In another aspect the invention relates to a kit for assessing the suitability of each of a plurality of 
test compounds for inhibiting ovarian cancer in a patient. The kit comprises reagents for assessing kallikrein 
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markers, and optionally CA125, wherein the markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. The kit may also 
comprise a plurality of test agents or compounds. 

The invention contemplates a kit for assessing the presence of ovarian cancer cells, wherein the kit 
5 comprises antibodies specific for selected kallikrein markers, and optionally CA125, wherein the markers 
comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11. 

Additionally the invention provides a kit for assessing the ovarian carcinogenic potential of a test 
compound. The kit comprises ovarian cells and reagents for assessing kallikrein markers, and optionally 
10 CA125, wherein the markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 
kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 

In an aspect the invention provides a method of treating a patient afflicted with ovarian cancer 
comprising providing to cells of a patient antisense oligonucleotides complementary to polynucleotides 
encoding kallikrein markers, and optionally CA125, which are overexpressed in ovarian cancer. In an 
15 alternative method, expression of genes corresponding to kallikrein markers, and optionally CA125, which 
are underexpressed in ovarian cancer are increased. 

The invention relates to a method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer comprising inhibiting or increasing expression (or overexpression) of genes encoding 
kallikrein markers and optionally CA125, wherein the markers comprise or are selected from the group 
20 consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, that are 
either overexpressed or underexpressed, in ovarian cancer. 

Other objects, features and advantages of the present invention will become apparent from the 
following detailed description. It should be understood, however, that the detailed description and the 
specific examples while indicating preferred embodiments of the invention are given by way of illustration 
25 only, since various changes and modifications within the spirit and scope of the invention will become 
apparent to those skilled in the art from this detailed description. 
DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the drawings in which 

Figure 1 is a graph showing hk5 concentration in serum from non-cancer and cancer patients. 
30 Figure 2 is a graph showing hk6 concentration in serum from non-cancer and cancer patients. 

Figure 3 is a graph showing hk7 concentration in serum from non-cancer and cancer patients. 

Figure 4 is a graph showing hk8 concentration in serum from non-cancer and cancer patients. 

Figure 5 is a graph showing hklO concentration in serum from non-cancer and cancer patients. 

Figure 6 is a graph showing hkll concentration in serum from non-cancer and cancer patients. 
35 Figure 7 is a graph showing CA125 concentration in serum from non-cancer and cancer patients. 

Figure 8 is a ROC curve illustrating the added value of using kallikreins and CA125 together in a 
multivariate function. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to newly discovered correlations between expression of certain markers and 
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ovarian cancer. The combinations of markers described herein may provide sensitive methods for detecting 
ovarian cancer. The levels of expression of a combination of markers described herein may correlate with the 
presence of ovarian cancer or a pre-malignant condition in a patient. Methods are provided for detecting the 
presence of ovarian cancer in a sample, the absence of ovarian cancer in a sample, the stage of an ovarian 
5 cancer, the grade of an ovarian cancer, the benign or malignant nature of an ovarian cancer, the metastatic 
potential of an ovarian cancer, assessing the histological type of neoplasm associated with the ovarian 
cancer, the indolence or aggressiveness of the cancer, and other characteristics of ovarian cancer that are 
relevant to prevention, diagnosis, characterization, and therapy of ovarian cancer in a patient. Methods are 
also provided for assessing the efficacy of one or more test agents for inhibiting ovarian cancer, assessing the 
10 efficacy of a therapy for ovarian cancer, monitoring the progression of ovarian cancer, selecting an agent or 
therapy for inhibiting ovarian cancer, treating a patient afflicted with ovarian cancer, inhibiting ovarian 
cancer in a patient, and assessing the carcinogenic potential of a test compound. 
Glossary 

The terms "sample", "biological sample", and the like, mean a material known or suspected of 
15 expressing or containing a plurality of kallikrien markers or polypeptides (2, 3, 4, 5, or 6 polypeptides), and 
optionally CA125 polypeptide, or polynucleotides encoding the polypeptides. The test sample can be used 
directly as obtained from the source or following a pretreatment to modify the character of the sample. The 
sample can be derived from any biological source, such as tissues, extracts, or cell cultures, including cells 
(e.g. tumor cells), cell lysates, and physiological fluids, such as, for example, whole blood, plasma, serum, 
20 saliva, ocular lens fluid, cerebral spinal fluid, sweat, urine, milk, ascites fluid, synovial fluid, peritoneal fluid 
and the like. The sample can be obtained from animals, preferably mammals, most preferably humans. The 
sample can be treated prior to use, such as preparing plasma from blood, diluting viscous fluids, and the like. 
Methods of treatment can involve filtration, distillation, extraction, concentration, inactivation of interfering 
components, the addition of reagents, and the like. Nucleic acids and polypeptides may be isolated from the 
25 samples and utilized in the methods of the invention. In a preferred embodiment, the sample is a serum 
sample. 

The term "subject" or "patient" refers to a warm-blooded animal such as a mammal, which is 
suspected of having ovarian cancer, or a condition, disease, or syndrome associated with ovarian cancer. 
Preferably, "subject" refers to a human. 

30 "CA125", "CA125 polypeptide", or "carbohydrate antigen 125" refers to a high-molecular weight 

mucin, which can be defined by its ability to bind to monoclonal antibody OC125. The CA125 protein core 
comprises a short cytoplasmic core tail, a transmembrane domain, and a large and heavily glycosylated 
extracellular domain dominated by a repeat domain of 156 amino acids rich in serine, threonine, and proline 
(Yin BW and Lloyd KO, J Biol Chem. 2001, 276:27371-27375; O'Brian TJ et al, Tumor Biol, 2001 22:348- 

35 366; and Hovig E. et al, Tumor Biol. 2001, 22:345-347). The sequence of CA125 is shown in GenBank 
Accession No. NP 078966, AAL65133 and AF414442 (SEQ ID NO. 1). The term includes the native- 
sequence polypeptides, isoforms, precursors and chimeric polypeptides. The term also includes the native 
sequence polypeptide, including polypeptide variants and polypeptides with substantial sequence identity 
(e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 
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99% sequence identity) to the sequence of GenBank Accession No.NP_078966 (SEQ ID NO. 1), and that 
preferably retain the immunogenic activity of the corresponding native sequence polypeptide. 

"Kallikrein polypeptides" or "kallikrein markers" comprise kallilkrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11. The term includes the native-sequence polypeptides, isoforms, 
5 precursors and chimeric polypeptides. The amino acid sequences for native kallikrein polypeptides employed 
in the present invention include the sequences found in GenBank for each polypeptide as shown in Table 1, 
and in SEQ ID NO: 3 (kallilkrein 5), NO.6 (kallikrein 6), NO. 10 (kallikrein 7), NO. 13 (kallikrein 8), NO. 
16 (kallikrein 10), and NOs. 19 and 20 (kallikrein 11), or a portion thereof. Other useful polypeptides are 
substantially identical to these sequences (e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 

10 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity), and preferably retain the immunogenic 
activity of the corresponding native-sequence kallikrein polypeptide. 

A "native-sequence polypeptide" comprises a polypeptide having the same amino acid sequence of 
a polypeptide derived from nature. Such native-sequence polypeptides can be isolated from nature or can be 
produced by recombinant or synthetic means. 

15 The term "native-sequence polypeptide" specifically encompasses naturally occurring truncated or 

secreted forms of a polypeptide, polypeptide variants including naturally occurring variant forms (e.g., 
alternatively spliced forms or splice variants), and naturally occurring allelic variants. 

The term "polypeptide variant" means a polypeptide having at least about 70-80%, preferably at 
least about 85%, more preferably at least about 90%, most preferably at least about 95% amino acid 

20 sequence identity with a native-sequence polypeptide, in particular having at least 70-80%, 85%, 90%, 95% 
amino acid sequence identity to the sequences identified in the GenBank Accession Nos. in Table 1 and 
Accession No. NP_078966, AF414442 and AAL65133 and shown in SEQ ID NOS: 1, 2, 3, 6, 10, 13, 16, 19 
and 20. Such variants include, for instance, polypeptides wherein one or more amino acid residues are 
added to, or deleted from, the N- or C-terminus of the full-length or mature sequences of SEQ ID NOS: 1, 2, 

25 3, 6, 10, 13, 16, 19 and 20, including variants from other species, but excludes a native-sequence 
polypeptide. 

An allelic variant may also be created by introducing substitutions, additions, or deletions into a 
nucleic acid encoding a native polypeptide sequence such that one or more amino acid substitutions, 
additions, or deletions are introduced into the encoded protein. Mutations may be introduced by standard 

30 methods, such as site-directed mutagenesis and PCR-mediated mutagenesis. In an embodiment, conservative 
substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino 
acid substitution" is one in which an animo acid residue is replaced with an amino acid residue with a similar 
side chain. Amino acids with similar side chains are known in the art and include amino acids with basic side 
chains (e.g. Lys, Arg, His), acidic side chains (e.g. Asp, Glu), uncharged polar side chains (e.g. Gly, Asp, 

35 Glu, Ser, Thr, Tyr and Cys), nonpolar side chains (e.g. Ala, Val, Leu, Iso, Pro, Trp), beta-branched side 
chains (e.g. Thr, Val, Iso), and aromatic side chains (e.g. Tyr, Phe, Trp, His). Mutations can also be 
introduced randomly along part or all of the native sequence, for example, by saturation mutagenesis. 
Following mutagenesis the variant polypeptide can be recombinantly expressed and the activity of the 
polypeptide may be determined. 
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Polypeptide variants include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of a native polypeptide which include fewer amino acids than the 
full length polypeptides. A portion of a polypeptide can be a polypeptide which is for example, 10, 15, 20, 
25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 or more amino acids in length. Portions in which regions of a 
5 polypeptide are deleted can be prepared by recombinant techniques and can be evaluated for one or more 
functional activities such as the ability to form antibodies specific for a polypeptide. 

A naturally occurring allelic variant may contain conservative amino acid substitutions from the 
native polypeptide sequence or it may contain a substitution of an amino acid from a corresponding position 
in a CA125 or kallikrein polypeptide homolog, for example, the murine CA125 or kallikrein polypeptide. 
10 Percent identity of two amino acid sequences, or of two nucleic acid sequences identified herein is 

defined as the percentage of amino acid residues or nucleotides in a candidate sequence that are identical 
with the amino acid residues in a CA125 or kallikrein polypeptide or nucleic acid sequence, after aligning 
the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not 
considering any conservative substitutions as part of the sequence identity. Alignment for purposes of 
15 determining percent amino acid or nucleic acid sequence identity can be achieved in various conventional 
ways, for instance, using publicly available computer software including the GCG program package 
(Devereux J. et al., Nucleic Acids Research 12(1): 387, 1984); BLASTP, BLASTN, and FASTA (Atschul, 
S.F. et al. J. Molec. Biol. 215: 403-410, 1990). The BLAST X program is publicly available from NCBI and 
other sources (BLAST Manual, Altschul, S. et al. NCBI NLM NIH Bethesda, Md. 20894; Altschul, S. et al. 
20 J. Mol. Biol. 215: 403-410, 1990). Skilled artisans can determine appropriate parameters for measuring 
alignment, including any algorithms needed to achieve maximal alignment over the full length of the 
sequences being compared. Methods to determine identity and similarity are codified in publicly available 
computer programs. 

CA125 and kallikrien polypeptides include chimeric or fusion proteins. A "chimeric protein" or 
25 "fusion protein" comprises all or part (preferably biologically active) of a CA125 or kallikrein polypeptide 
operably linked to a heterologous polypeptide (i.e., a polypeptide other than the same CA125 or kallikrein 
polypeptide). Within the fusion protein, the term "operably linked" is intended to indicate that the CA125 or 
kallikrein polypeptide and the heterologous polypeptide are fused in-frame to each other. The heterologous 
polypeptide can be fused to the N-terminus or C-terminus of the CA125 or kallikrein polypeptide. A useful 
30 fusion protein is a GST fusion protein in which a kallikrein polypeptide is fused to the C-terminus of GST 
sequences. Another example of a fusion protein is an immunoglobulin fusion protein in which all or part of a 
CA125 or kallikrein polypeptide is fused to sequences derived from a member of the immunoglobulin 
protein family. Chimeric and fusion proteins can be produced by standard recombinant DNA techniques. 

CA125 and kallikrein polypeptides may be isolated from a variety of sources, such as from human 
35 tissue types or from another source, or prepared by recombinant or synthetic methods, or by any combination 
of these and similar techniques. 

"CA125 polynucleotides" or "polynucleotides encoding CA125" include nucleic acids that encode a 
native-sequence polypeptide, a polypeptide variant including a portion of a CA125 polypeptide, an isoform, 
precursor, and chimeric polypeptide. A nucleic acid sequence encoding native CA125 employed in the 
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present invention includes the nucleic acid sequence in GenBank Accession No. AF414442 and SEQ ID NO. 
2, or a fragment thereof. 

"Kallikrein polynucleotides" or "polynucleotides encoding kallikrein markers/polypeptides" refers 
to kallilkrein 5 nucleic acids (KLK5), kallikrein 6 nucleic acids (KLK6), kallikrein 7 nucleic acids (KLK7), 
5 kallikrein 8 nucleic acids (KLK8), kallikrein 10 nucleic acids (KLK10), and/or kallikrein 11 nucleic acids 
(KLK11). The term includes nucleic acids that encode a native-sequence polypeptide, a polypeptide variant 
including a portion of a kallikrein polypeptide, an isoform, precursor, and chimeric polypeptide. 

The polynucleotide sequences encoding native kallikrein polypeptides employed in the present 
invention include the nucleic acid sequences of the GenBank Accession Nos. identified in Table 1, and in 
10 SEQ ID NOs: 4 and 5 (KLK5), NOs. 7, 8, and 9 (KLK6), NOs. 1 1 and 12 (KLK 7), NOs. 14 and 15 (KLK8), 
NOs. 17 and 18 (KLK10), and NOs. 21 and 22 (KLK1 1), or a fragment thereof. 

Polynucleotides encoding kallikrien polypeptides and CA125 include nucleic acid sequences 
complementary to these polynucleotides, and polynucleotides that are substantially identical to these 
sequences (e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%,90%, 95%, 97%, 
15 98%, or 99% sequence identity). 

CA125 and kallikrein polynucleotides also include sequences which differ from a nucleic acid 
sequence of GenBank Accession Nos. identified in Table 1 and SEQ ID NOS: 2, 4, 5, 7, 8, 9, 11, 12, 14, 15, 
17, 18, 21, and 22, due to degeneracy in the genetic code. As one example, DNA sequence polymorphisms 
within the nucleotide sequence of a CA125 or kallikrein polypeptide may result in silent mutations which do 
20 not affect the amino acid sequence. Variations in one or more nucleotides may exist among individuals 
within a population due to natural allelic variation. DNA sequence polymorphisms may also occur which 
lead to changes in the amino acid sequence of CA125 or a kallikrein polypeptide. 

CA125 and kallikrein polynucleotides also include nucleic acids that hybridize under stringent 
conditions, preferably high stringency conditions to a nucleic acid sequence of the GenBank Accession Nos. 
25 identified in Table 1 and SEQ ID NOS: 2, 4, 5, 7, 8, 9, 11, 12, 14, 15, 17, 18, 21, and 22. Appropriate 
stringency conditions which promote DNA hybridization are known to those skilled in the art, or can be 
found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For 
example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0 x SSC at 
50°C may be employed. The stringency may be selected based on the conditions used in the wash step. By 
30 way of example, the salt concentration in the wash step can be selected from a high stringency of about 0.2 x 
SSC at 50°C. In addition, the temperature in the wash step can be at high stringency conditions, at about 
65°C 

CA125 and kallikrein polynucleotides also include truncated nucleic acids or fragments and variant 
forms of the polynucleotides that arise by alternative splicing of an mRNA corresponding to a DNA. 
35 The CA125 and kallikrien polynucleotides are intended to include DNA and RNA (e.g. mRNA) and 

can be either double stranded or single stranded. A polynucleotide may, but need not, include additional 
coding or non-coding sequences, or it may, but need not, be linked to other molecules and/or carrier or 
support materials. The polynucleotides for use in the methods of the invention may be of any length suitable 
for a particular method. 
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A purality of kallikrein polypeptides or kallikrein polynucleotides are generally detected in the 
present invention. "Plurality" refers to 2, 3, 4, 5, or 6 kallikrein polypeptides or polynucleotides, in particular 
3, 4, 5, or 6, preferably 4, 5, or 6, more preferably 5 or 6 kallikrein polypeptides or polynucleotides. 

In an embodiment a plurality of kallikrein polypeptides is selected from the group consisting of 
5 kallikrein 5, kallikrein 7, and kallikrein 8; kallikrein 5, kallikrein 8, and kallikrein 10; kallikrein 7, kallikrein 
8, and kallikrein 10; kallikrein 5, kallikrein 7, kallikrein 8, and kallikrein 10; kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11; or kallikrein 5, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. In 
another embodiment, a plurality of kallikrein polypeptides is selected from the group consisting of kallilkrein 
5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 and kallikrein 11. 

10 In an embodiment, a pluraity of kallikrein polynucleotides is selected from the group consisting of 

KLK5, KLK7, and KLK8; KLK5, KLK8 and KLK10; KLK7, KLK8 and KLK10; KLK5, KLK7, KLK8, and 
KLK10; KLK7, KLK8, KLK10 and KLK11, or KLK5, KLK7, KLK8, KLK10 and KLK11. In another 
embodiment, a plurality of kallikrein polynucleotides is selected from the group consisting of KLK5, KLK6, 
KLK7, KLK8, KLK10, and KLK11. 

15 General Methods 

A variety of methods can be employed for the diagnostic and prognostic evaluation of ovarian 
cancer involving kallikrein polypeptides, and optionally CA125 polypeptide, and polynucleotides encoding 
the polypeptides, and the identification of subjects with a predisposition to such disorders. Such methods 
may, for example, utilize polynucleotides encoding kallikrein polypeptides, and optionally CA125, and 

20 fragments thereof, and binding agents (e.g. antibodies aptamers) against kallikrein polypeptides, and 
optionally CA125 polypeptide, including peptide fragments. In particular, the polynucleotides and antibodies 
may be used, for example, for (1) the detection of either over- or under-expression of kallikrein 
polynucleotides, and optionally CA125, relative to a non-disorder state; and (2) the detection of either an 
over- or an under-abundance of kallikrein polypeptides, and optionally CA125, relative to a non-disorder 

25 state or the presence of modified (e.g., less than full length) kallikrein polypeptides, and optionally CA125, 
that correlate with a disorder state, or a progression toward a disorder state. 

The invention also contemplates a method for detecting ovarian cancer comprising producing a 
profile of levels of a plurality of kallikrein markers, and optionally CA125, in cells from a patient, wherein 
the markers are kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

30 comparing the profile with a reference to identify a protein profile for the test cells indicative of disease. 

The methods described herein may be used to evaluate the probability of the presence of malignant 
or pre-malignant cells, for example, in a group of cells freshly removed from a host. Such methods can be 
used to detect tumors, quantitate their growth, and help in the diagnosis and prognosis of disease. The 
methods can be used to detect the presence of cancer metastasis, as well as confirm the absence or removal 

35 of all tumor tissue following surgery, cancer chemotherapy, and/or radiation therapy. They can further be 
used to monitor cancer chemotherapy and tumor reappearance. 

The methods described herein can be adapted for diagnosing and monitoring ovarian cancer by 
detecting a plurality of kallikrein polypeptides, and optionally CA125 polypeptide, or nucleic acids encoding 
the polypeptides in biological samples from a subject. These applications require that the amount of 
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polypeptides or nucleic acids quantitated in a sample from a subject being tested be compared to a 
predetermined standard. The standard may correspond to levels quantitated for another sample or an earlier 
sample from the subject, or levels quantitated for a control sample. Levels for control samples from healthy 
subjects or ovarian cancer subjects may be established by prospective and/or retrospective statistical studies. 
5 Healthy or normal subjects who have no clinically evident disease or abnormalities may be selected for 
statistical studies. Diagnosis may be made by a finding of statistically different levels of a plurality of 
kallikrein polypeptides, and optionally CA125, or nucleic acids encoding same, compared to a control 
sample or previous levels quantitated for the same subject. A "significant difference" in levels of kallikrein 
markers or polynucleotides encoding the kallikrein markers in a patient sample compared to a control or 

10 standard (e.g. normal levels or levels in other samples from a patient) may represent levels that are higher or 
lower than the standard error of the detection assay, preferably the levels are at least about 1.5, 2, 3, 4, 5, or 6 
times higher or lower, respectively, than the control or standard. The difference in levels of markers or 
polynucleotides may be a "statistically significant difference" 
Nucleic Acid Methods/Assays 

15 As noted herein an ovarian cancer may be detected based on the levels of polynucleoitdes encoding 

kallikrein polypeptides, and optionally CA125, in a sample. Techniques for detecting polynucleotides such 
as polymerase chain reaction (PCR) and hybridization assays are well known in the art. 

Nucleotide probes for use in the detection of nucleic acid sequences in samples may be constructed 
using conventional methods known in the art. Suitable probes may be based on nucleic acid sequences 

20 encoding at least 5 sequential amino acids from regions of nucleic acids encoding kallikrein polypeptides, 
and optionally CA125, preferably they comprise 15 to 40 nucleotides. A nucleotide probe may be labeled 
with a detectable substance such as a radioactive label that provides for an adequate signal and has sufficient 
half-life such as 32 P, 3 H, 14 C or the like. Other detectable substances that may be used include antigens that 
are recognized by a specific labeled antibody, fluorescent compounds, enzymes, antibodies specific for a 

25 labeled antigen, and luminescent compounds. An appropriate label may be selected having regard to the rate 
of hybridization and binding of the probe to the nucleotide to be detected and the amount of nucleotide 
available for hybridization. Labeled probes may be hybridized to nucleic acids on solid supports such as 
nitrocellulose filters or nylon membranes as generally described in Sambrook et al, 1989, Molecular 
Cloning, A Laboratory Manual (2nd ed.). The nucleic acid probes may be used to detect polynucleoitides 

30 encoding kallikrein polypeptides, and optionally CA125, preferably in human cells. The nucleotide probes 
may also be useful in the diagnosis of ovarian cancer involving polynucleoitides encoding kallikrein 
polypeptides, and optionally CA125, in monitoring the progression of such disorder; or monitoring a 
therapeutic treatment. 

Probes may be used in hybridization techniques to detect nucleic acids encoding a plurality of 
35 kallikrein polypeptides, and optionally CA125. The technique generally involves contacting and incubating 
nucleic acids (e.g. recombinant DNA molecules, cloned genes) obtained from a sample from a patient or 
other cellular source with probes under conditions favorable for the specific annealing of the probes to 
complementary sequences in the nucleic acids. After incubation, the non-annealed nucleic acids are 
removed, and the presence of nucleic acids that have hybridized to the probe if any are detected. 
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The detection of polynucleotides encoding kallikrein polypeptides and optionally CA125, may 
involve the amplification of specific gene sequences using an amplification method such as polymerase 
chain reaction (PCR), followed by the analysis of the amplified molecules using techniques known to those 
skilled in the art. Suitable primers can be routinely designed by one of skill in the art. 
5 By way of example, oligonucleotide primers may be employed in a PCR based assay to amplify a 

portion of nucleic acids encoding each of a plurality of kallikrein polypeptides, and optionally CA125, 
derived from a sample, wherein the oligonucleotide primers are specific for (i.e. hybridize to) 
polynucleotides encoding each of the plurality of kallikrein polypeptides, and optionally CA125. The 
amplified cDNA is then separated and detected using techniques well known in the art, such as gel 
10 electrophoresis. 

In order to maximize hybridization under assay conditions, primers and probes employed in the 
methods of the invention generally have at least about 60%, preferably at least about 75% and more 
preferably at least about 90% identity to a portion of polynucleotides encoding a plurality of kallikrein 
polypeptides, and CA125. The primers and probes may be at least 10 nucleotides, and preferably at least 20 
15 nucleotides in length. In an embodiment the primers and probes are at least about 10-40 nucleotides in 
length. 

Hybridization and amplification techniques described herein may be used to assay qualitative and 
quantitative aspects of expression of polynucleotides encoding kallikrein polypeptides, and optionally 
CA125. For example, RNA may be isolated from a cell type or tissue known to express these 
20 polynucleotides and tested utilizing the hybridization (e.g. standard Northern analyses) or PCR techniques 
referred to herein. 

The primers and probes may be used in the above- described methods in situ i.e directly on tissue 
sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections. 

In an aspect of the invention, a method is provided employing reverse transcriptase-polymerase 
25 chain reaction (RT-PCR), in which PCR is applied in combination with reverse transcription. Generally, 
RNA is extracted from a sample tissue using standard techniques (for example, guanidine isothiocyanate 
extraction as described by Chomcynski and Sacchi, Anal. Biochem. 162:156-159, 1987) and is reverse 
transcribed to produce cDNA. The cDNA is used as a template for a polymerase chain reaction. The cDNA 
is hybridized to sets of primers specifically designed against each of a plurality of kallikrein polynucleotide 
30 sequences, and optionally CA125. Once the primer and template have annealed a DNA polymerase is 
employed to extend from the primer, to synthesize a copy of the template. The DNA strands are denatured, 
and the procedure is repeated many times until sufficient DNA is generated to allow visualization by 
ethidium bromide staining and agarose gel electrophoresis. 

Amplification may be performed on samples obtained from a subject with suspected ovarian cancer 
35 and an individual who is not afflicted with ovarian cancer. The reaction may be performed on several 
dilutions of cDNA spanning at least two orders of magnitude. A statistically significant difference in 
expression in several dilutions of the subject sample as compared to the same dilutions of the non- cancerous 
sample may be considered positive for the presence of ovarian cancer. 

Oligonucleotides or longer fragments derived from polynucleotides encoding each of a plurality of 
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kallikrein polypeptides and optionally CA125, may be used as targets in a microarray. The microarray can be 
used to simultaneously monitor the expression levels of large numbers of genes. The information from the 
microarray may be used to diagnose a disorder, and to develop and monitor the activities of therapeutic 
agents. 

5 The preparation, use, and analysis of microarrays are well known to a person skilled in the art. (See, 

for example, Brennan, T. M. et al. (1995) U.S. Pat. No. 5,474,796; Schena, et al. (1996) Proc. Natl. Acad. 
Sci. 93:10614-10619; Baldeschweiler et al. (1995), PCT Application W095/251116; Shalon, D. et al. (I 995) 
PCT application WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad. Sci. 94:2150-2155; and Heller, 
M. J. et al. (1997) U.S. Pat. No. 5,605,662.) 

10 Thus, the invention also includes an array comprising a plurality of polynucleotides encoding 

kallikrein marker(s), and optionally CA125 polynucleotides. The array can be used to assay expression of 
kallikrein polynucleotides, and optionally CA125 polynucleotides in the array. The invention allows the 
quantitation of expression of a plurality of kallikrein polynucleotides, and optionally CA125 polynucleotides. 

In an embodiment, the array can be used to monitor the time course of expression of a plurality of 

15 kallikrein polynucleotides, and optionally CA125 polynucleotides, in the array. This can occur in various 
biological contexts such as tumor progression. 

The array is also useful for ascertaining differential expression patterns of a plurality of kallikrein 
polynucleotides and optionally CA125 polynucleotides, in normal and abnormal cells. This provides a 
battery of polynucleotides that could serve as molecular targets for diagnosis or therapeutic intervention. 

20 Protein Methods 

Binding agents specific for a plurality of kallikrein markers and CA125 may be used for a variety of 
diagnostic and assay applications. There are a variety of assay formats known to the skilled artisan for using 
a binding agent to detect a target molecule in a sample. (For example, see Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1988). In general, the presence or absence of an ovarian 
25 cancer in a subject may be determined by (a) contacting a sample from the subject with binding agents for a 
plurality of kallikrein polypeptides, and optionally CA125; (b) detecting in the sample levels of polypeptides 
that bind to the binding agents; and (c) comparing the levels of polypeptides with a predetermined standard 
or cut- off value. 

"Binding agent" refers to a substance such as a polypeptide or antibody that specifically binds to a 
30 kallikrein or CA125 polypeptide. A substance "specifically binds" to a polypeptide if it reacts at a detectable 
level with the kallikrein or CA125 polypeptide, and does not react detectably with peptides containing 
unrelated sequences or sequences of different polypeptides. Binding properties may be assessed using an 
ELISA, which may be readily performed by those skilled in the art (see for example, Newton et al , Develop. 
Dynamics 197: 1-13, 1993). 

35 A binding agent may be a ribosome, with or without a peptide component, an aptamer, an RNA 

molecule, or a polypeptide. A binding agent may be a polypeptide that comprises a kallikrein polypeptide or 
CA125 polypeptide sequence, a peptide variant thereof, or a non-peptide mimetic of such a sequence. By 
way of example a kallikrein polypeptide sequence may be a peptide portion of a kallikrein polypeptide that is 
capable of modulating a function mediated by the kallikrein polypeptide. 
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An aptamer includes a DNA or RNA molecule that binds to polynucleotides and polypeptides. An 
aptamer that binds to a polypeptide (or binding domain) of a kallikrein polypeptide or a polynucleotide 
encoding a kallikrein polypeptide can be produced using conventional techniques, without undue 
experimentation. [For example, see the following publications describing in vitro selection of aptamers: Klug 
5 et al., Mol. Biol. Reports 20:97-107 (1994); Wallis et al., Chem. Biol. 2:543-552 (1995); Ellington, Curr. 
Biol. 4:427-429 (1994); Lato et al., Chem. Biol. 2:291-303 (1995); Conrad et al., Mol. Div. 1:69-78 (1995); 
and Uphoff et al., Curr. Opin. Struct. Biol. 6:281-287 (1996)]. 

In certain other preferred embodiments, the binding agent is an antibody. 

In an aspect the present invention provides a diagnostic method for monitoring or diagnosing 
10 ovarian cancer in a subject by quantitating a plurality of kallikrein polypeptides, and optionally CA125, in a 
biological sample from the subject comprising reacting the sample with antibodies specific for a plurality of 
kallikrein polypeptides, and optionally CA125, which are directly or indirectly labelled with detectable 
substances, and detecting the detectable substances. 

In an aspect of the invention, a method for detecting ovarian cancer is provided comprising: 
15 (a) obtaining a sample suspected of containing a plurality of kallikrein polypeptides, and 

optionally CA125, wherein the kallikrein polypeptides comprise or are selected from the 
group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 and 
kallikrein 11; 

(b) contacting the sample with antibodies that specifically bind to the plurality of kallikrein 
20 polypeptides, and optionally CA125, under conditions effective to bind the antibodies and 

form complexes; 

(c) measuring the amount of kallikrein polypeptides, and optionally CA125, present in the 
sample by quantitating the amount of the complexes; and 

(d) comparing the amount of kallikrein polypeptides, and optionally CA125, present in the 
25 samples with the amount of polypeptides in a control, wherein a change or significant 

difference in the amount of polypeptides in the sample compared with the amount in the 
control is indicative of ovarian cancer. 
In an embodiment, the invention contemplates a method for monitoring the progression of ovarian 
cancer in an individual, comprising: 
30 (a) contacting antibodies which bind to each of a plurality of kallikrein polypeptides, and 

optionally CA125, with a sample from the individual so as to form binary complexes 
comprising each of the antibodies and polypeptides in the sample; 

(b) determining or detecting the presence or amount of complex formation in the sample; 

(c) repeating steps (a) and (b) at a point later in time; and 

35 (d) comparing the result of step (b) with the result of step (c), wherein a difference in the 

amount of complex formation is indicative of the stage and/or progression of the ovarian 
cancer in said individual. 

The amount of complexes may also be compared to a value representative of the amount of the 
complexes from an individual not at risk of, or afflicted with, ovarian cancer at different stages. 
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Thus, antibodies specifically reactive with each of a plurality of kallikrein polypeptides, and 
CA125, or derivatives, such as enzyme conjugates or labeled derivatives, may be used to detect a plurality of 
kallikrein polypeptides, and optionally CA125, in various samples (e.g. biological materials). They may be 
used as diagnostic or prognostic reagents and they may be used to detect abnormalities in the levels of 
5 expression of a plurality of kallikrein polypeptides, and optionally CA125, or abnormalities in the structure, 
and/or temporal, tissue, cellular, or subcellular location of a plurality of kallikrein polypeptides, and 
optionally CA125. Antibodies may also be used to screen potentially therapeutic compounds in vitro to 
determine their effects on ovarian cancer involving a plurality of kallikrein polypeptides, and optionally 
CA125, and other conditions. In vitro immunoassays may also be used to assess or monitor the efficacy of 
10 particular therapies. 

Antibodies may be used in any known immunoassays that rely on the binding interaction between 
antigenic determinants of a plurality of kallikrein polypeptides, and optionally CA125, and the antibodies. 
Examples of such assays are radioimmunoassays, enzyme immunoassays (e.g. ELISA), 
immunofluorescence, immunoprecipitation, latex agglutination, hemagglutination, and histochemical tests. 

15 These terms are well understood by those skilled in the art. A person skilled in the art will know, or can 
readily discern, other immunoassay formats without undue experimentation. 

In particular, the antibodies may be used in immunohistochemical analyses, for example, at the 
cellular and sub-subcellular level, to detect a plurality of kallikrein polypeptides, and optionally CA125, to 
localize them to particular ovarian tumor cells and tissues, and to specific subcellular locations, and to 

20 quantitate the level of expression. 

Antibodies for use in the present invention include monoclonal or polyclonal antibodies, 
immunologically active fragments (e.g. a Fab or (Fab) 2 fragments), antibody heavy chains, humanized 
antibodies, antibody light chains, genetically engineered single chain F v molecules (Ladner et al, U.S. Pat. 
No. 4,946,778), chimeric antibodies, for example, antibodies which contain the binding specificity of murine 

25 antibodies, but in which the remaining portions are of human origin, or derivatives, such as enzyme 
conjugates or labeled derivatives. 

Antibodies including monoclonal and polyclonal antibodies, fragments and chimeras, may be 
prepared using methods known to those skilled in the art. Isolated native or recombinant kallikrein 
polypeptides or CA125 may be utilized to prepare antibodies. See, for example, Kohler et al. (1975) Nature 

30 256:495-497; Kozbor et al. (1985) J. Immunol Methods 81:31-42; Cote et al. (1983) Proc Natl Acad Sci 
80:2026-2030; and Cole et al. (1984) Mol Cell Biol 62:109-120 for the preparation of monoclonal 
antibodies; Huse et al. (1989) Science 246:1275-1281 for the preparation of monoclonal Fab fragments; and, 
Pound (1998) Immunochemical Protocols, Humana Press, Totowa, N.J for the preparation of phagemid or B- 
lymphocyte immunoglobulin libraries to identify antibodies. The antibodies specific for kallikrein 

35 polypeptides or CA125 used in the methods of the invention may also be obtained from scientific or 
commercial sources. 

In an embodiment of the invention, antibodies are reactive against kallikrein polypeptides or CA125 
if they bind with a K a of greater than or equal to 10" 7 M. 

Antibodies that bind to kallikrein polypeptides or CA125 may be labelled with a detectable 
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substance and localised in biological samples based upon the presence of the detectable substance. Examples 
of detectable substances include, but are not limited to, the following: radioisotopes (e.g., 3 H, 14 C, 35 S, 125 I, 
131 I), fluorescent labels (e.g., FITC, rhodamine, lanthanide phosphors), luminescent labels such as luminol, 
enzymatic labels (e.g., horseradish peroxidase, beta-galactosidase, luciferase, alkaline phosphatase, 
5 acetylcholinesterase), biotinyl groups (which can be detected by marked avidin e.g., streptavidin containing a 
fluorescent marker or enzymatic activity that can be detected by optical or colorimetric methods), and 
predetermined polypeptide epitopes recognized by a secondary reporter (e.g., leucine zipper pair sequences, 
binding sites for secondary antibodies, metal binding domains, epitope tags). In some embodiments, labels 
are attached via spacer arms of various lengths to reduce potential steric hindrance. Antibodies may also be 
10 coupled to electron dense substances, such as ferritin or colloidal gold, which are readily visualised by 
electron microscopy. 

Indirect methods may also be employed in which the primary antigen-antibody reaction is amplified 
by the introduction of a second antibody, having specificity for the antibody reactive against a kallikrein 
polypeptide or CA125. The second antibody may be labeled with a detectable substance to detect the 
15 primary antigen-antibody reaction. By way of example, if the antibody having specificity against a kallikrein 
polypeptide is a rabbit IgG antibody, the second antibody may be goat anti-rabbit gamma-globulin labelled 
with a detectable substance as described herein. 

Methods for conjugating or labelling the antibodies discussed above may be readily accomplished 
by one of ordinary skill in the art. (See for example Inman, Methods In Enzymology, Vol. 34, Affinity 
20 Techniques, Enzyme Purification: Part B, Jakoby and Wichek (eds.), Academic Press, New York, p. 30, 
1974; and Wilchek and Bayer, "The Avidin-Biotin Complex in Bioanalytical Applications, "Anal. Biochem. 
171:1-32, 1988 re methods for conjugating or labelling the antibodies with enzyme or ligand binding 
partner). 

Cytochemical techniques known in the art for localizing antigens using light and electron 
25 microscopy may be used to detect a plurality of kallikrein polypeptides, and optionally CA125. Generally, 
antibodies may be labeled with detectable substances and kallikrein polypeptides, and optionally CA125, 
may be localised in tissues and cells based upon the presence of the detectable substance. 

In the context of the methods of the invention, the sample, binding agents (e.g. antibodies) for a 
plurality of kallikrein polypeptides, and CA125 may be immobilized on a carrier or support. Examples of 
30 suitable carriers or supports are agarose, cellulose, nitrocellulose, dextran, Sephadex, Sepharose, liposomes, 
carboxymethyl cellulose, polyacrylamides, polystyrene, gabbros, filter paper, magnetite, ion-exchange resin, 
plastic film, plastic tube, glass, polyamine-methyl vinyl-ether-maleic acid copolymer, amino acid copolymer, 
ethylene-maleic acid copolymer, nylon, silk, etc. The support material may have any possible configuration 
including spherical (e.g. bead), cylindrical (e.g. inside surface of a test tube or well, or the external surface of 
35 a rod), or flat (e.g. sheet, test strip). Thus, the carrier may be in the shape of, for example, a tube, test plate, 
well, beads, disc, sphere, etc. The immobilized material may be prepared by reacting the material with a 
suitable insoluble carrier using known chemical or physical methods, for example, cyanogen bromide 
coupling. Binding agents (e.g. antibodies) may be indirectly immobilized using second binding agents 
specific for the first binding agent. For example, mouse antibodies specific for a kallikrein polypeptide may 
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be immobilized using sheep anti-mouse IgG Fc fragment specific antibody coated on the carrier or support. 

Where radioactive labels are used as a detectable substance, a plurality of kallikrein polypeptides, 
and optionally CA125, may be localized by radioautography. The results of radioautography may be 
quantitated by determining the density of particles in the radioautographs by various optical methods, or by 
5 counting the grains. 

Time-resolved fluorometry may be used to detect a signal. For example, the method described in 
Christopoulos TK and Diamandis EP Anal Chem 1992:64:342-346 may be used with a conventional time- 
resolved fluorometer. 

Therefore, in accordance with an embodiment of the invention, a method is provided wherein 

10 antibodies specific for each of a plurality of kallikrein polypeptides, and optionally CA125, are labelled with 
enzymes, substrates for the enzymes are added wherein the substrates are selected so that the substrates, or a 
reaction product of the enzymes and substrates, form fluorescent complexes with lanthanide metals. 
Lanthanide metals are added and the plurality of kallikrein polypeptides, and optionally CA125, are 
quantitated in the sample by measuring fluorescence of the fluorescent complexes. Antibodies specific for 

15 CA125 and each of a plurality of kallikrein polypeptides may be directly or indirectly labelled with enzymes. 
Enzymes are selected based on the ability of a substrate of the enzyme, or a reaction product of the enzyme 
and substrate, to complex with lanthanide metals such as europium and terbium. Examples of suitable 
enzymes include alkaline phosphatase and P-galactosidase. 

Examples of enzymes and substrates for enzymes that provide such fluorescent complexes are 

20 described in U.S. Patent No. 5,312,922 to Diamandis. By way of example, when the antibody is directly or 
indirectly labelled with alkaline phosphatase the substrate employed in the method may be 4- 
methylumbelliferyl phosphate, 5-fluorosalicyl phosphate, or diflunisal phosphate. The fluorescence intensity 
of the complexes is typically measured using a time-resolved fluorometer e.g. a CyberFluor 615 
Imunoanalyzer (Nordion International, Kanata, Ontario). 

25 Antibodies specific for a plurality of kallikrein polypeptides and CA125 may also be indirectly 

labelled with enzymes. For example, an antibody may be conjugated to one partner of a ligand binding pair, 
and the enzyme may be coupled to the other partner of the ligand binding pair. Representative examples 
include avidin-biotin, and riboflavin-riboflavin binding protein. In another embodiment, antibodies specific 
for the anti-kallikrein antibodies or anti- CA125 antibodies are labeled with an enzyme. 

30 In accordance with an embodiment, the present invention provides means for determining a 

plurality of kallikrein polypeptides, and optionally CA125, in a sample, in particular a serum sample, by 
measuring a plurality of kallikrein polypeptides, and optionally CA125, by immunoassay. It will be evident 
to a skilled artisan that a variety of immunoassay methods can be used to measure a plurality of kallikrein 
polypeptides and CA125 in serum. In general, an immunoassay method may be competitive or 

35 noncompetitive. Competitive methods typically employ immobilized or immobilizable antibodies to each of 
a plurality of kallikrein polypeptides, and optionally CA125, and a labeled form of each of a plurality of 
kallikrein polypeptides, and optionally CA125. Kallikrein polypeptides and CA125 and labeled kallikrein 
polypeptides and CA125 compete for binding to anti-kallikrein antibodies and anti-CA125 antibodies. After 
separation of the resulting labeled kallikrein polypeptides and CA125 that have become bound to anti- 
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kallikrein polypeptides and anti- CA125 (bound fraction) from that which has remained unbound (unbound 
fraction), the amount of the label in either bound or unbound fraction is measured and may be correlated 
with the amount of kallikrein polypeptides, and optionally CA125, in the test sample in any conventional 
manner, e.g., by comparison to a standard curve. 
5 In an aspect, a non-competitive method is used for the determination of a plurality of kallikrein 

polypeptides, and optionally CA125, with the most common method being the "sandwich" method. In this 
assay, two types of antibodies specific for each of a plurality of kallikrein polypeptides, and optionally 
CA125 are employed. One type of antibody is directly or indirectly labeled (sometimes referred to as the 
"detection antibody") and the other is immobilized or immobilizable (sometimes referred to as the "capture 
10 antibody"). The capture and detection antibodies can be contacted simultaneously or sequentially with a test 
sample. Sequential methods can be accomplished by incubating capture antibodies with the sample, and 
adding the detection antibodies at a predetermined time thereafter (sometimes referred to as the "forward" 
method); or the detection antibodies can be incubated with the sample first and then the capture antibodies 
added (sometimes referred to as the "reverse" method). After the necessary incubation(s) have occurred, to 
15 complete the assay, the capture antibodies are separated from the liquid test mixture, and labels are measured 
in at least a portion of the separated capture antibody phase or the remainder of the liquid test mixture. 
Generally the labels are measured in the capture antibody phase since it comprises kallikrein polypeptides, 
and optionally CA125, bound by ("sandwiched" between) the capture and detection antibodies. In an 
embodiment, the label may be measured without separating the capture antibodies and liquid test mixture. 
20 In a typical two-site immunometric assay for a plurality of kallikrein polypeptides, and optionally 

CA125, one or both of the capture and detection antibodies are polyclonal antibodies or one or both of the 
capture and detection antibodies are monoclonal antibodies (i.e. polyclonal/polyclonal, 
monoclonal/monoclonal, or monoclonal/polyclonal). The labels used with the detection antibodies can be 
selected from any of those known conventionally in the art. The labels may be an enzyme or a 
25 chemiluminescent moiety, but it can also be a radioactive isotope, a fluorophor, a detectable ligand (e.g., 
detectable by a secondary binding by a labeled binding partner for the ligand), and the like. Preferably 
antibodies are labelled with enzymes which are detected by adding substrates that are selected so that a 
reaction product of the enzymes and substrates forms fluorescent complexes. Capture antibodies may be 
selected so that they provide a means for being separated from the remainder of the test mixture. 
30 Accordingly, the capture antibodies can be introduced to the assay in an already immobilized or insoluble 
form, or can be in an immobilizable form, that is, a form which enables immobilization to be accomplished 
subsequent to introduction of the capture antibodies to the assay. An immobilized capture antibody may 
comprise an antibody covalently or noncovalently attached to a solid phase such as a magnetic particle, a 
latex particle, a microtiter plate well, a bead, a cuvette, or other reaction vessel. An example of an 
35 immobilizable capture antibody is antibody which has been chemically modified with a ligand moiety, e.g., a 
hapten, biotin, or the like, and which can be subsequently immobilized by contact with an immobilized form 
of a binding partner for the ligand, e.g., an antibody, avidin, or the like. In an embodiment, a capture 
antibody may be immobilized using a species specific antibody for the capture antibody that is bound to the 
solid phase. 
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A particular sandwich immunoassay method of the invention employs two types of antibodies, first 
antibodies are reactive against each of a plurality of kallikrein polypeptides, and optionally CA125, and 
second antibodies having specificity against antibodies reactive against each of a plurality of kallikrein 
polypeptides, and optionally CA125, labelled with enzymatic labels, and fluorogenic substrates for the 
5 enzymes. An enzyme may be alkaline phosphatase (ALP) and the substrate is 5-fluorosalicyl phosphate. 
ALP cleaves phosphate out of the fluorogenic substrate, 5-fluorosalicyl phosphate, to produce 5- 
fluorosalicylic acid (FSA). 5-Fluorosalicylic acid can then form a highly fluorescent ternary complex of the 
form FSA~Tb(3+)-EDTA, which can be quantified by measuring the Tb3+ fluorescence in a time-resolved 
mode. Fluorescence intensity is measured using a time-resolved fluorometer as described herein. 
10 The above-described immunoassay methods and formats are intended to be exemplary and are not 

limiting. 

Computer Systems 

Computer readable media comprising a plurality of kallikrein markers, and optionally CA125, is 
also provided. "Computer readable media" refers to any medium that can be read and accessed directly by a 
15 computer, including but not limited to magnetic storage media, such as floppy discs, hard disc storage 
medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. Thus, the invention 
contemplates computer readable medium having recorded thereon markers identified for patients and 
controls. 

20 "Recorded" refers to a process for storing information on computer readable medium. The skilled 

artisan can readily adopt any of the presently known methods for recording information on computer 
readable medium to generate manufactures comprising information on a plurality of kallikrein markers, and 
optionally CA125. 

A variety of data processor programs and formats can be used to store information on a plurality of 
25 kallikrein markers, and optionally CA125, on computer readable medium. For example, the information can 
be represented in a word processing text file, formatted in commercially- available software such as 
WordPerfect and MicroSoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. Any number of dataprocessor structuring formats (e.g., 
text file or database) may be adapted in order to obtain computer readable medium having recorded thereon 
30 the marker information. 

By providing the marker information in computer readable form, one can routinely access the 
information for a variety of purposes. For example, one skilled in the art can use the information in computer 
readable form to compare marker information obtained during or following therapy with the information 
stored within the data storage means. 
35 The invention provides a medium for holding instructions for performing a method for determining 

whether a patient has ovarian cancer or a pre-disposition to ovarian cancer, comprising determining the 
presence or absence of a plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, and based on the presence or absence of the plurality of kallikrein markers, optionally CA125, and/or 
polynucleotides encoding same, determining whether the patient has ovarian cancer or a pre-disposition to 
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ovarian cancer, and optionally recommending treatment for the ovarian cancer or pre-ovarian cancer 
condition. 

The invention also provides in an electronic system and/or in a network, a method for determining 
whether a subject has ovarian cancer or a pre-disposition to ovarian cancer associated with a plurality of 
5 kallikrein markers, and optionally CA125, and/or polynucleotides encoding same, comprising determining 
the presence or absence of a plurality of kallikrein markers, and optionally CA125, and/or polynucleotides 
encoding same, and based on the presence or absence of the plurality of kallikrein markers, and optionally 
CA125, and/or polynucleotides encoding same, determining whether the subject has ovarian cancer or a pre- 
disposition to ovarian cancer, and optionally recommending treatment for the ovarian cancer or pre-ovarian 
10 cancer condition. 

The invention further provides in a network, a method for determining whether a subject has 
ovarian cancer or a pre-disposition to ovarian cancer associated with a plurality of kallikrein markers, 
optionally CA125 and/or polynucleotides encoding same, comprising: (a) receiving phenotypic information 
on the subject and information on a plurality of kallikrein markers, optionally CA125 and/or polynucleotides 

15 encoding same associated with samples from the subject; (b) acquiring information from the network 
corresponding to the plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same; and (c) based on the phenotypic information and information on the plurality of kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, determining whether the subject has ovarian 
cancer or a pre-disposition to ovarian cancer; and (d) optionally recommending treatment for the ovarian 

20 cancer or pre-ovarian cancer condition. 

The invention still further provides a system for identifying selected records that identify an ovarian 
cancer cell. A system of the invention generally comprises a digital computer; a database server coupled to 
the computer; a database coupled to the database server having data stored therein, the data comprising 
records of data comprising a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 

25 encoding same, and a code mechanism for applying queries based upon a desired selection criteria to the data 
file in the database to produce reports of records which match the desired selection criteria. 

In an aspect of the invention a method is provided for detecting an ovarian cancer cell using a 
computer having a processor, memory, display, and input/output devices, the method comprising the steps 
of: 

30 (a) creating records of a plurality of kallikrein markers, optionally CA125, and/or 



polynucleotides encoding same, isolated from a sample suspected of containing an ovarian 
cancer cell; 



GO 



providing a database comprising records of data comprising a plurality of kallikrein 
markers, optionally CA125, wherein the markers are kallikrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11, and/or comprising polynucleotides encoding 
same; and 



35 



(c) 



using a code mechanism for applying queries based upon a desired selection criteria to the 
data file in the database to produce reports of records of step (a) which provide a match of 
the desired selection criteria of the database of step (b) the presence of a match being a 



WO 2004/075713 



PCT/CA2004/000281 



-23- 

positive indication that the markers of step (a) have been isolated from a cell that is an 
ovarian cancer cell. 

The invention contemplates a business method for determining whether a subject has ovarian cancer 
or a pre-disposition to ovarian cancer associated with a plurality of kallikrein markers, optionally CA125, 
5 and/or polynucleotides encoding same, comprising: (a) receiving phenotypic information on the subject and 
information on a plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same, 
associated with samples from the subject; (b) acquiring information from a network corresponding to the 
plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same; and (c) based on 
the phenotypic information, information on a plurality of kallikrein markers, optionally CA125, and/or 
10 polynucleotides encoding same, and acquired information, determining whether the subject has ovarian 
cancer or a pre-disposition to ovarian cancer; and (d) optionally recommending treatment for the ovarian 
cancer or pre-ovarian cancer condition. 
Imaging Methods 

Antibodies specific for each of a plurality of kallikrein polypeptides, and optionally CA125, may 
15 also be used in imaging methodologies in the management of ovarian cancer. The invention provides a 
method for imaging tumors associated with a plurality of kallikrein polypeptides, and optionally CA125. 

In an embodiment the method is an in vivo method and a subject or patient is administered imaging 
agents that carry imaging labels and are capable of targeting or binding to each of a plurality of kallikrein 
polypeptides, and optionally CA125. In the method each imaging agent is labeled so that it can be 
20 distinguished during the imaging. The imaging agents are allowed to incubate in vivo and bind to the 
plurality of kallikrein polypeptides, and optionally CA125, associated with ovarian tumors. The presence of 
label is localized to the ovarian cancer, and the localized label is detected using imaging devices known to 
those skilled in the art. 

The imaging agents may be antibodies or chemical entities that recognize the plurality of kallikrein 
25 polypeptides, and optionally CA125. In an aspect of the invention an imaging agent is a polyclonal antibody 
or monoclonal antibody, or fragments thereof, or constructs thereof including but not limited to, single chain 
antibodies, bifunctional antibodies, molecular recognition units, and peptides or entities that mimic peptides. 
The antibodies specific for kallikrein polypeptides and CA125 used in the methods of the invention may be 
obtained from scientific or commercial sources, or isolated native or recombinant kallikrein and CA125 
30 polypeptides may be utilized to prepare antibodies etc as described herein. 

An imaging agent may be a peptide that mimics the epitope for an antibody specific for kallikrein 
polypeptide or CA125 and binds to kallikrein polypeptide or CA125. The peptide may be produced on a 
commercial synthesizer using conventional solid phase chemistry. By way of example, a peptide may be 
prepared that includes either tyrosine, lysine, or phenylalanine to which N 2 S 2 chelate is complexed (See U.S. 
35 Patent No. 4,897,255). The anti-kallikrein peptide conjugate is then combined with a radiolabel (e.g. sodium 
99m Tc pertechnetate or sodium 188 Re perrhenate) and it may be used to locate a tumor producing a plurality 
of kallikrein polypeptides, and optionally CA125. 

Imaging agents carry labels to image the plurality of kallikrein polypeptides and CA125. Agents 
may be labelled for use in radionuclide imaging. In particular, agents may be directly or indirectly labelled 
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with a radioisotope. Examples of radioisotopes that may be used in the present invention are the following: 
277 Ac, 2U At, 128 Ba, I31 Ba, 7 Be, 204 Bi, 205 Bi, 206 Bi, 76 Br, 77 Br, 82 Br, I09 Cd, 47 Ca, n C, i4 C, 36 C1, 48 Cr, 51 Cr, 62 Cu, 
64 Cu, 67 Cu, 165 Dy, 155 Eu, I8 F, 153 Gd, 66 Ga, 67 Ga, 68 Ga, 72 Ga 5 19S Au, 3 H, I66 Ho, in In, I13m In, 115m In, 123 I, 125 I, 13I I, 

189j rj IPlm^ 192 Irj ,94^ 52^ 55^ 59^ 177^ 1 5q 191^-191^ 10^ 32^ 33p 3 42^ 226^ 186^ 188^ ,L R ^ 

5 153 Sm, 46 Sc, 47 Sc, 72 Se, 75 Se, 105 Ag, 22 Na, 24 Na, 89 Sr, 35 S, 38 S, 177 Ta, 96 Tc, 99m Tc, 201 T1, 202 T1 ? 113 Sn, 117m Sn, 
121 Sn, 166 Yb, 169 Yb, 175 Yb, 88 Y, 90 Y 3 62 Zn and 65 Zn. Preferably the radioisotope is 131 I, 125 I, 123 I 3 99m Tc, 
90 Y, 1S6 Re 5 188 Re, 32 P 5 I53 Sm 9 67 Ga 5 20I T1 77 Br, or 18 F, and it is imaged with a photoscanning device. 

Procedures for labeling biological agents with the radioactive isotopes are generally known in the 
art. U.S. Pat. No. 4,302,438 describes tritium labeling procedures. Procedures for iodinating, tritium labeling, 
10 and 35 S labeling especially adapted for murine monoclonal antibodies are described by Goding, J. W. (supra, 
pp 124-126) and the references cited therein. Other procedures for iodinating biological agents, such as 
antibodies, binding portions thereof, probes, or ligands, are described in the scientific literature (see Hunter 
and Greenwood, Nature 144:945 (1962), David et al., Biochemistry 13:1014-1021 (1974), and U.S. Pat. Nos. 
3,867,517 and 4,376,110). Iodinating procedures for agents are described by Greenwood, F. et al., Biochem. 

15 J. 89:114-123 (1963); Marchalonis, J., Biochem. J. 113:299-305 (1969); and Morrison, M. et al., 
Immunochemistry, 289-297 (1971). 99m Tc-labeling procedures are described by Rhodes, B. et al. in 
Burchiel, S. et al. (eds.), Tumor Imaging: The Radioimmunochemical Detection of Cancer, New York: 
Masson 111-123 (1982) and the references cited therein. Labelling of antibodies or fragments with 
technetium-99m are also described for example in U.S. Pat. No. 5,317,091, U.S. Pat. No. 4,478,815, U.S. 

20 Pat. No. 4,478,818, U.S. Pat. No. 4,472,371, U.S. Pat. No. Re 32,417, and U.S. Pat. No. 4,311,688. 
Procedures suitable for 111 In-labeling biological agents are described by Hnatowich, D. J. et al., J. Immul. 
Methods, 65:147-157 (1983), Hnatowich, D. et al., J. Applied Radiation, 35:554-557 (1984), and Buckley, 
R. G. et al., F.E.B.S. 166:202-204 (1984). 

An imaging agent may also be labeled with a paramagnetic isotope for purposes of an in vivo 

25 method of the invention. Examples of elements that are useful in magnetic resonance imaging include 
gadolinium, terbium, tin, iron, or isotopes thereof. (See, for example, Schaefer et al., (1989) JACC 14, 472- 
480; Shreve et al., (1986) Magn. Reson. Med. 3, 336-340; Wolf, G L., (1984) Physiol. Chem. Phys. Med. 
NMR 16, 93-95; Wesbey et al., (1984) Physiol. Chem. Phys. Med. NMR 16, 145-155; Runge et al., (1984) 
Invest. Radiol. 19, 408-415 for discussions on in vivo nuclear magnetic resonance imaging.) 

30 In the case of radiolabeled agents, the agents may be administered to the patient, localized to the 

tumor having a plurality of kallikrein polypeptides, and optionally CA125, with which the agents bind, and 
detected or "imaged" in vivo using known techniques such as radionuclear scanning using, for example, a 
gamma camera or emission tomography. [See for example, A. R. Bradwell et al., "Developments in 
Antibody Imaging", Monoclonal Antibodies for Cancer Detection and Therapy, R. W. Baldwin et al., (eds.), 

35 pp. 65-85 (Academic Press 1985)]. A positron emission transaxial tomography scanner, such as designated 
Pet VI located at Brookhaven National Laboratory, can also be used where the radiolabel emits positrons 
(e.g., 11 C, 18 F, 15 O, and 13 N). 

Whole body imaging techniques using radioisotope labeled agents can be used for locating both 
primary tumors and tumors which have metastasized. Antibodies specific for a plurality of kallikrein 



WO 2004/075713 



PCT/CA2004/000281 



-25- 

polypeptides, and optionally CA125, or fragments thereof having the same epitope specificity, are bound to a 
suitable radioisotope, or a combination thereof, and administered parenterally. For ovarian cancer, 
administration preferably is intravenous. The bio-distribution of the labels can be monitored by scintigraphy, 
and accumulations of the labels can be related to the presence of ovarian cancer cells. Whole body imaging 
5 techniques are described in U.S. Pat. Nos. 4,036,945 and 4,311,688. Other examples of agents useful for 
diagnosis and therapeutic use that can be coupled to antibodies and antibody fragments include 
metallothionein and fragments (see, U.S. Pat. No. 4,732,864). These agents are useful in diagnosis, staging 
and visualization of cancer, in particular ovarian cancer, so that surgical and/or radiation treatment protocols 
can be used more efficiently. 

10 Screening Methods 

The invention also contemplates methods for evaluating test agents or compounds for their ability to 
inhibit ovarian cancer or potentially contribute to ovarian cancer. Test agents and compounds include but 
are not limited to peptides such as soluble peptides including Ig-tailed fusion peptides, members of random 
peptide libraries and combinatorial chemistry- derived molecular libraries made of D- and/or L- configuration 

15 amino acids, phosphopeptides (including members of random or partially degenerate, directed 
phosphopeptide libraries), antibodies [e.g. polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, 
single chain antibodies, fragments, (e.g. Fab, F(ab)2, and Fab expression library fragments, and epitope- 
binding fragments thereof)], nucleic acids (e.g. antisense, interference RNA) and small organic or inorganic 
molecules. The agents or compounds may be endogenous physiological compounds or natural or synthetic 

20 compounds. 

The invention also provides a method for assessing the potential efficacy of a test agent for 
inhibiting ovarian cancer in a patient, the method comprising comparing: 

(a) levels of a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in a first sample obtained from a patient and exposed to the test agent, 

25 wherein the markers comprise or are selected from the group consisting of kallikrein 5, 

kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

(b) levels of the plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in a second sample obtained from the patient, wherein the sample is not 
exposed to the test agent, wherein a significant difference in the levels of expression of a 

30 plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same, 

in the first sample, relative to the second sample, is an indication that the test agent is 
potentially efficacious for inhibiting ovarian cancer in the patient. 
The first and second samples may be portions of a single sample obtained from a patient or portions 
of pooled samples obtained from a patient. 
35 In an aspect, the invention provides a method of selecting an agent for inhibiting ovarian cancer in a 

patient comprising: 

(a) obtaining a sample comprising cancer cells from the patient; 

(b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
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encoding same, in each of the aliquots, wherein the markers comprise or are selected from 
the group consisting of kallikrein 5 S kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, 
and kallikrein 11; and 

(d) selecting one of the test agents which alters the levels of the kallikrein markers, optionally 
5 CA125, and/or polynucleotides encoding same, in the aliquot containing that test agent, 

relative to other test agents. 
Still another aspect of the present invention provides a method of conducting a drug discovery 
business comprising: 

(a) providing one or more methods or assay systems for identifying agents that inhibit ovarian 
1 0 cancer in a patient; 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, 
for efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step 
(b) as having an acceptable therapeutic profile. 

15 In certain embodiments, the subject method can also include a step of establishing a distribution 

system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a 
sales group for marketing the pharmaceutical preparation. 

The invention also contemplates a method of assessing the ovarian carcinogenic potential of a test 
compound comprising: 

20 (a) maintaining separate aliquots of ovarian cells in the presence and absence of the test 

compound; and 

(b) comparing a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in each of the aliquots, wherein the markers comprise or are selected from 
the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, 
25 and kallikrein 1 1 . 

A significant difference between the levels of the markers in the aliquot maintained in the presence 
of (or exposed to) the test compound relative to the aliquot maintained in the absence of the test compound, 
indicates that the test compound possesses ovarian carcinogenic potential. 
Kits 

30 The methods described herein may be performed by utilizing pre-packaged diagnostic kits 

comprising at least a plurality of kallikrein nucleic acids or binding agents (e.g. antibodies) or CA125 nucleic 
acids or binding agents described herein, which may be conveniently used, e.g., in clinical settings, to screen 
and diagnose patients, and to screen and identify those individuals afflicted with or exhibiting a 
predisposition to ovarian cancer. 

35 Thus, the invention also contemplates kits for carrying out the methods of the invention. Such kits 

typically comprise two or more components required for performing a diagnostic assay. Components include 
but are not limited to compounds, reagents, containers, and/or equipment. 

In an embodiment, a container with a kit comprises binding agents as described herein. By way of 
example, the kit may contain antibodies specific for a plurality of kallikrein polypeptides, and optionally 
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CA125, antibodies against the antibodies labelled with enzymes; and substrates for the enzymes. The kit may 
also contain microtiter plate wells, standards, assay diluent, wash buffer, adhesive plate covers, and/or 
instructions for carrying out a method of the invention using the kit. 

In an aspect of the invention, the kit includes antibodies or antibody fragments which bind 
5 specifically to epitopes of each of a plurality of kallikrein polypeptides, and optionally CA125, and means 
for detecting binding of the antibodies to epitopes associated with tumor cells, either as concentrates 
(including lyophilized compositions), which may be further diluted prior to use or at the concentration of 
use, where the vials may include one or more dosages. Where the kits are intended for in vivo use, single 
dosages may be provided in sterilized containers, having the desired amount and concentration of agents. 

10 Containers that provide a formulation for direct use, usually do not require other reagents, as for example, 
where the kit contains radiolabeled antibody preparations for in vivo imaging. 

A kit may be designed to detect the level of polynucleotides encoding kallikrein polypeptides, and 
optionally CA125 polynucleotides, in a sample. Such kits generally comprise oligonucleotide probes or 
primers, as described herein, that hybridize to a plurality of polynucleotides encoding kallikrein polypeptides 

15 and optionally CA125. Such oligonucleotides may be used, for example, within a PGR or hybridization 
procedure. Additional components that may be present within the kits include second oligonucleotides and/or 
diagnostic reagents to facilitate detection of a plurality polynucleotides encoding kallikrein polypeptides, and 
optionally CA125 polynucleotides. 

The reagents suitable for applying the screening methods of the invention to evaluate compounds 

20 may be packaged into convenient kits described herein providing the necessary materials packaged into 
suitable containers. 
Applications 

Kallikrein polypeptides (in particular, kallikrein 5, 6, 10 and 11), optionally in combination with 
CA125, are targets for ovarian cancer immunotherapy. Such immunotherapeutic methods include the use of 

25 antibody therapy, in vivo vaccines, and ex vivo immunotherapy approaches. 

In one aspect, the invention provides antibodies specific for a plurality of kallikrein polypeptides 
(for example, kallikreins 5, 6, 10 and 11) and optionally CA125, that may be used systemically to treat 
ovarian cancer. Preferably antibodies are used that target the tumor cells but not the surrounding non-tumor 
cells and tissue. Thus, the invention provides a method of treating a patient susceptible to, or having a cancer 

30 that expresses a plurality of kallikrein polypeptides, and optionally CA125, comprising administering to the 
patient an effective amount of antibodies that bind specifically to a plurality of kallikrein polypeptides, and 
optionally CA125. In another aspect, the invention provides a method of inhibiting the growth of tumor cells 
expressing a plurality of kallikrein polypeptides, and optionally CA125, comprising administering to a 
patient antibodies which bind specifically to the plurality of kallikrein polypeptides, and optionally CA125, 

35 in amounts effective to inhibit growth of the tumor cells. Antibodies specific for a plurality of kallikrein 
polypeptides, and optionally CA125, may also be used in a method for selectively inhibiting the growth of, 
or killing a cell expressing a plurality of kallikrein polypeptides, and optionally CA125, comprising reacting 
antibody immunoconjugates or immunotoxins with the cell in an amount sufficient to inhibit the growth of, 
or kill the cell. 
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By way of example, unconjugated antibodies specific for a plurality of kallikrein polypeptides, and 
optionally CA125, may be introduced into a patient such that the antibodies bind to cancer cells expressing a 
plurality of kallikrein polypeptides, and optionally CA125, and mediate growth inhibition of such cells 
(including the destruction thereof), and the tumor, by mechanisms which may include complement-mediated 
5 cytolysis, antibody- dependent cellular cytotoxicity, altering the physiologic function of a plurality of 
kallikrein polypeptides, and optionally CA125, and/or the inhibition of ligand binding or signal transduction 
pathways. In addition to unconjugated antibodies, antibodies specific for a plurality of kallikrein 
polypeptides, and optionally CA125, conjugated to therapeutic agents (e.g. immunoconjugates) may also be 
used therapeutically to deliver the agents directly to tumor cells expressing a plurality of kallikrein 

10 polypeptides, and optionally CA125, and thereby destroy the tumor. Examples of such agents include abrin, 
ricin A, Pseudomonas exotoxin, or diphtheria toxin, proteins such as tumor necrosis factor, alpha-interferon, 
beta-interferon, nerve growth factor, platelet derived growth factor, tissue plasminogen activator, and 
biological response modifiers such as lymphokines, interleukin-1, interleukin-2, interleukin-6, granulocyte 
macrophage colony stimulating factor, granulocyte colony stimulating factor, or other growth factors. 

I 5 Cancer immunotherapy using antibodies specific for a plurality of kallikrein polypeptides, and 

optionally CA125, may utilize the various approaches that have been successfully employed for cancers, 
including but not limited to colon cancer (Arlen et al., 1998, Crit Rev Immunol 18: 133-138), multiple 
myeloma (Ozaki et al., 1997, Blood 90: 3179-3186; Tsunenati et al., 1997, Blood 90: 2437-2444), gastric 
cancer (Kasprzyk et al., 1992, Cancer Res 52: 2771-2776), B-cell lymphoma (Funakoshi et al., 1996, J 

20 Immunther Emphasis Tumor Immunol 19: 93-101), leukemia (Zhong et al., 1996, Leuk Res 20: 581-589), 
colorectal cancer (Moun et al., 1994, Cancer Res 54: 6160-6166); Velders et al., 1995, Cancer Res 55: 4398- 
4403), and breast cancer (Shepard et al., 1991, J Clin Immunol 11: 1 17-127). 

In the practice of a method of the invention, antibodies specific for a plurality of kallikrein 
polypeptides, optionally in combination with antibodies specific for CA125, capable of inhibiting the growth 

25 of cancer cells expressing a plurality of kallikrein polypeptides, and optionally CA125, are administered in a 
therapeutically effective amount to cancer patients whose tumors express or overexpress a plurality of 
kallikrein polypeptides, and optionally CA125. The invention may provide a specific, effective and long- 
needed treatment for ovarian cancer. The antibody therapy methods of the invention may be combined with 
other therapies including chemotherapy and radiation. 

30 Patients may be evaluated for the presence and levels of a plurality of kallikrein polypeptides, and 

optionally CA125, expression and overexpression in tumors, preferably using immunohistochemical 
assessments of tumor tissue, quantitative imaging as described herein, or other techniques capable of reliably 
indicating the presence and degree of expression of a plurality of kallikrein polypeptides, and optionally 
CA125. Immunohistochemical analysis of tumor biopsies or surgical specimens may be employed for this 

35 purpose. 

Antibodies specific for a plurality of kallikrein polypeptides and CA125 useful in treating cancer 
include those that are capable of initiating a potent immune response against the tumor and those that are 
capable of direct cytotoxicity. In this regard, the antibodies may elicit tumor cell lysis by either complement- 
mediated or antibody-dependent cell cytotoxicity (ADCC) mechanisms, both of which require an intact Fc 
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portion of the immunoglobulin molecule for interaction with effector cell Fc receptor sites or complement 
proteins. In addition, antibodies specific for a plurality of kallikrein polypeptides and CA125 that exert a 
direct biological effect on tumor growth are useful in the practice of the invention. Such antibodies may not 
require the complete immunoglobulin to exert the effect. Potential mechanisms by which such directly 
5 cytotoxic antibodies may act include inhibition of cell growth, modulation of cellular differentiation, 
modulation of tumor angiogenesis factor profiles, and the induction of apoptosis. The mechanism by which a 
particular antibody exerts an anti-tumor effect may be evaluated using any number of in vitro assays 
designed to determine ADCC, antibody- dependent macrophage-mediated cytotoxicity (ADMMC), 
complement-mediated cell lysis, and others known in the art. 

10 The anti-tumor activity of a combination of antibodies specific for a plurality of kallikrein 

polypeptides and optionally CA125, may be evaluated in vivo using a suitable animal model. Xenogenic 
cancer models, wherein human cancer explants or passaged xenograft tissues are introduced into immune 
compromised animals, such as nude or SCID mice, may be employed. 

The methods of the invention contemplate the administration of combinations, or "cocktails" of 

15 different individual antibodies recognizing epitopes of a plurality of kallikrein polypeptides, and optionally' 
CA125. Such cocktails may have certain advantages inasmuch as they contain antibodies that bind to 
different epitopes and/or exploit different effector mechanisms or combine directly cytotoxic antibodies with 
antibodies that rely on immune effector functionality. Such antibodies in combination may exhibit 
synergistic therapeutic effects. In addition, the administration of the antibodies may be combined with other 

20 therapeutic agents, including but not limited to chemotherapeutic agents, androgen-blockers, and immune 
modulators (e.g., IL2, GM-CSF). The antibodies may be administered in their "naked" or unconjugated form, 
or may have therapeutic agents conjugated to them. 

The antibodies specific for a plurality of kallikrein polypeptides and optionally CA125, used in the 
practice of the method of the invention may be formulated into pharmaceutical compositions comprising a 

25 carrier suitable for the desired delivery method. Suitable carriers include any material which when combined 
with the antibodies retains the anti-tumor function of the antibodies and is non-reactive with the subject's 
immune systems. Examples include any of a number of standard pharmaceutical carriers such as sterile 
phosphate buffered saline solutions, bacteriostatic water, and the like (see, generally, Remington's 
Pharmaceutical Sciences 16.sup.th Edition, A. Osal., Ed., 1980). 

30 Antibody formulations may be administered via any route capable of delivering the antibodies to 

the tumor site. Routes of administration include, but are not limited to, intravenous, intraperitoneal, 
intramuscular, intratumor, intradermal, and the like. Preferably, the route of administration is by intravenous 
injection. Antibody preparations may be lyophilized and stored as a sterile powder, preferably under 
vacuum, and then reconstituted in bacteriostatic water containing, for example, benzyl alcohol preservative, 

35 or in sterile water prior to injection. 

Treatment will generally involve the repeated administration of the antibody preparation via an 
acceptable route of administration such as intravenous injection (IV), at an effective dose. Dosages will 
depend upon various factors generally appreciated by those of skill in the art, including the type of cancer 
and the severity, grade, or stage of the cancer, the binding affinity and half life of the antibodies used, the 
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degree of expression of a plurality of kallikrein polypeptides, and optionally CA125, in the patient, the extent 
of circulating kallikrein polypeptide antigens, and optionally CA125 antigens, the desired steady-state 
antibody concentration level, frequency of treatment, and the influence of any chemotherapeutic agents used 
in combination with a treatment method of the invention. 
5 Daily doses may range from about 0.1 to 100 mg/kg. Doses in the range of 10-500 mg antibodies 

per week may be effective and well tolerated, although even higher weekly doses may be appropriate and/or 
well tolerated. A determining factor in defining the appropriate dose is the amount of antibodies necessary to 
be therapeutically effective in a particular context. Repeated administrations may be required to achieve 
tumor inhibition or regression. Direct administration of antibodies specific for a plurality of kallikrein 

10 polypeptides and optionally CA125 is also possible and may have advantages in certain situations. 

Patients may be evaluated for a plurality of kallikrein polypeptides and optionally CA125, 
preferably in serum, in order to assist in the determination of the most effective dosing regimen and related 
factors. The assay methods described herein, or similar assays, may be used for quantitating circulating 
kallikrein polypeptide and optionally CA125 levels in patients prior to treatment. Such assays may also be 

15 used for monitoring throughout therapy, and may be useful to gauge therapeutic success in combination with 
evaluating other parameters, such as serum kallikrein polypeptides, and optionally CA125, levels. 

The invention further provides vaccines formulated to contain a plurality of kallikrein polypeptides, 
and optionally CA125, or fragments thereof. The use in anti- cancer therapy of tumor antigens in a vaccine 
for generating humoral and cell-mediated immunity is well known and, for example, has been employed in 

20 prostate cancer using human PSMA and rodent PAP immunogens (Hodge et al, 1995, Int. J. Cancer 63: 231- 
237; Fong et al, 1997, J. Immunol. 159: 3113-3117). These methods can be practiced by employing a 
plurality of kallikrein polypeptides, and optionally CA125, or fragments thereof, or nucleic acids and 
recombinant vectors capable of expressing and appropriately presenting the kallikrein and optionally CA125, 
immunogens. 

25 By way of example, viral gene delivery systems may be used to deliver nucleic acids encoding a 

plurality of kallikrein polypeptides, and optionally CA125. Various viral gene delivery systems which can be 
used in the practice of this aspect of the invention include, but are not limited to, vaccinia, fowlpox, 
canarypox, adenovirus, influenza, poliovirus, adeno-associated virus, lentivirus, and sindbus virus (Restifo, 
1996, Curr. Opin. Immunol. 8: 658-663). Non-viral delivery systems may also be employed by using naked 

30 DNA encoding a plurality of kallikrein polypeptides, and optionally CA125, or fragments thereof introduced 
into the patient (e.g., intramuscularly) to induce an anti-tumor response. 

Various ex vivo strategies may also be employed. One approach involves the use of cells to present 
kallikrein and optionally CA125 antigens to a patient's immune system. For example, autologous dendritic 
cells which express MHC class I and II, may be pulsed with a plurality of kallikrein polypeptides, and 

35 optionally CA125, or peptides thereof that are capable of binding to MHC molecules, to thereby stimulate 
ovarian cancer patients* immune systems (See, for example, Tjoa et al., 1996, Prostate 28: 65-69; Murphy et 
al., 1996, Prostate 29: 371-380). 

Anti-idiotypic antibodies specific for a plurality of kallikrein polypeptides, and optionally CA125, 
can also be used in anti-cancer therapy as a vaccine for inducing an immune response to cells expressing the 
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polypeptides. The generation of anti-idiotypic antibodies is well known in the art and can readily be adapted 
to generate anti-idiotypic antibodies that mimic an epitope on a kallikrein polypeptide or CA125 (see, for 
example, Wagner et al., 1997, Hybridoma 16: 33-40; Foon et al., 1995, J Clin Invest 96: 334-342; Herlyn et 
aL, 1996, Cancer Immunol Immunother 43: 65-76). Such antibodies can be used in anti-idiotypic therapy as 
5 presently practiced with other anti-idiotypic antibodies directed against tumor antigens. 

Genetic immunization methods may be utilized to generate prophylactic or therapeutic humoral and 
cellular immune responses directed against cancer cells expressing a plurality of kallikrein polypeptides, and 
optionally CA125. Constructs comprising DNA encoding kallikrein and optionally CA125 
polypeptides/immunogens and appropriate regulatory sequences may be injected directly into muscle or skin 

10 of an individual, such that the cells of the muscle or skin take-up the construct and express the encoded 
kallikrein or CA125 polypeptides/immunogens. The polypeptides/immunogens may be expressed as cell 
surface proteins or be secreted. Expression of the polypeptides/immunogens results in the generation of 
prophylactic or therapeutic humoral and cellular immunity against the cancer. Various prophylactic and 
therapeutic genetic immunization techniques known in the art may be used. 

15 The invention further provides methods for inhibiting cellular activity (e.g., cell proliferation, 

activation, or propagation) of a cell expressing a plurality of kallikrein polypeptides, and optionally CA125. 
This method comprises reacting immunoconjugates of the invention (e.g., a heterogeneous or homogenous 
mixture) with the cell so that the kallikrein polypeptides, and optionally CA125, form complexes with the 
immunoconjugates. A subject with a neoplastic or preneoplastic condition can be treated when the inhibition 

20 of cellular activity results in cell death. 

In another aspect, the invention provides methods for selectively inhibiting a cell expressing a 
plurality of kallikrein polypeptides, and optionally CA125, by reacting a combination of immunoconjugates 
of the invention with the cell in an amount sufficient to inhibit the cell. Amounts include those that are 
sufficient to kill the cell or sufficient to inhibit cell growth or proliferation. 

25 Vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial 

plasmids, may be used to deliver nucleic acids encoding a plurality of kallikrein polypeptides, and optionally 
CA125, to a targeted organ, tissue, or cell population. Methods well known to those skilled in the art may be 
used to construct recombinant vectors that will express antisense nucleic acid molecules for kallikrein 
polypeptides and CA125. (See, for example, the techniques described in Sambrook et al (supra) and Ausubel 

30 et al (supra)). 

Genes encoding a plurality of kallikrein polypeptides, and optionally CA125, can be turned off by 
transfecting a cell or tissue with vectors that express high levels of a desired kallikrein or CA125 
polypeptide-encoding fragments. Such constructs can inundate cells with untranslatable sense or antisense 
sequences. Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
35 molecules until all copies are disabled by endogenous nucleases. 

Modifications of gene expression can be obtained by designing antisense molecules, DNA, RNA or 
PNA, to the regulatory regions of genes encoding kallikrein polypeptides, and optionally CA125, i.e., the 
promoters, enhancers, and introns. Preferably, oligonucleotides are derived from the transcription initiation 
site, eg, between -10 and +10 regions of the leader sequence. The antisense molecules may also be designed 
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so that they block translation of mRNA by preventing the transcript from binding to ribosomes. Inhibition 
may also be achieved using "triple helix" base-pairing methodology. Triple helix pairing compromises the 
ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or 
regulatory molecules. Therapeutic advances using triplex DNA were reviewed by Gee J E et al (In: Huber B 
5 E and B I Can' (1994) Molecular and Immunologic Approaches, Futura Publishing Co, Mt Kisco N.Y.). 

Ribozymes are enzymatic RNA molecules that catalyze the specific cleavage of RNA. Ribozymes 
act b}' sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. The invention therefore contemplates engineered hammerhead motif ribozyme 
molecules that can specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding a 

1 0 plurality of kallikrein polypeptides, and optionally CA1 25. 

Specific ribozyme cleavage sites within any potential RNA target may initially be identified by 
scanning the target molecule for ribozyme cleavage sites which include the following sequences, GUA, 
GUU and GUC. Once the sites are identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target gene containing the cleavage site may be evaluated for secondary 

15 structural features which may render the oligonucleotide inoperable. The suitability of candidate targets may 
also be determined by testing accessibility to hybridization with complementary oligonucleotides using 
ribonuclease protection assays. 

Methods for introducing vectors into cells or tissues include those methods discussed herein and 
which are suitable for in vivo, in vitro and ex vivo therapy. For ex vivo therapy, vectors may be introduced 

20 into stem cells obtained from a patient and clonally propagated for autologous transplant into the same 
patient (See U.S. Pat. Nos. 5,399,493 and 5,437,994). Delivery by transfection and by liposome are well 
known in the art. 

Kallikrein polypeptides, optionally CA125 polypeptide, and/or polynucleotides encoding the 
polypeptides, and fragments thereof, antibodies and/or agents identified using a method of the invention, or 

25 combinations thereof, may be used in the treatment of ovarian cancer or diseases, conditions or syndromes 
associated with ovarian cancer, in a subject. A combination of kallikrein polypeptides and/or 
polynucleotides encoding the kallikreins (e.g. kallikreins 7 and 8) and inhibitors (antisense, antibodies, or 
agents) of other kallikreins (e.g. kallikreins 5, 6, 10 and 11) and/or CA125 may be used in a prognostic or 
therapeutic method of the invention. The polypeptides, polynucleotides, and agents may be formulated into 

30 compositions for administration to subjects suffering from ovarian cancer. Therefore, the present invention 
also relates to a composition comprising a plurality of kallikrein polypeptides and optionally CA125, or 
nucleic acids encoding the polypeptides, or a fragment thereof, or an agent identified using a method of the 
invention, and a pharmaceutically acceptable carrier, excipient or diluent. A method for treating or 
preventing ovarian cancer in a subject is also provided comprising administering to a patient in need thereof, 

35 a plurality of kallikrein polypeptides and optionally CA125, or nucleic acids encoding the polypeptides, an 
agent identified in accordance with a method of the invention, and/or a composition of the invention. 

The active substance may be administered in a convenient manner such as by injection 
(subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, or rectal 
administration. Depending on the route of administration, the active substance ma}' be coated in a material to 
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protect the substance from the action of enzymes, acids and other natural conditions that may inactivate the 
substance. 

The compositions described herein can be prepared by per se known methods for the preparation of 
pharmaceutically acceptable compositions which can be administered to subjects, such that an effective 
5 quantity of the active substance is combined in a mixture with a pharmaceutically acceptable vehicle. 
Suitable vehicles are described, for example, in Remington's Pharmaceutical Sciences (Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., USA 1985). On this basis, the 
compositions include, albeit not exclusively, solutions of the active substances in association with one or 
more pharmaceutically acceptable vehicles or diluents, and contained in buffered solutions with a suitable 
10 pH and iso-osmotic with the physiological fluids. 

The compositions are indicated as therapeutic agents either alone or in conjunction with other 
therapeutic agents or other forms of treatment (e.g. chemotherapy or radiotherapy). The compositions of the 
invention may be administered concurrently, separately, or sequentially with other therapeutic agents or 
therapies. 

15 The following non-limiting examples are illustrative of the present invention: 

Example 1 

To investigate the additional discriminatory value of the kallikreins to CA125 a logistic regression 
model was developed. Included in the study were serum samples from 39 ovarian cancer patients and 194 
non-cancer females. The age of the patients was as follows: median = 59, range 32-82. The age of the 

20 controls was as follows: median = 46; range = 22-77. The model was adjusted for the following variables: 
f(x) = -0.29 hK5* +0.12* hK6-0.65*hK7-0.6*hK84-1.09*hK10+0.98*hKll+0.057*CA125-0.62. For these 
data, the crude odds ratio and the 95% confidence interval were found to be 2.71 and 1.91-3.84 (p<0.001). 
The log likelihood scores for this multivariate logistic regression model, which incorporated the combined 
variables for each patient was calculated. From these data, by picking different thresholds for the regression 

25 function values, a ROC curve was devised which shows the added value of using kallikreins and CA125 
together in a multivariate function.( AUC, 0.99;95%CI,0.96-L00). (See Figure 8.) Statistically significant 
correlations between age and other studied variables were not observed. 
Example 2 

Statistically significant differences in serum kallikrein concentration was found between patient and 
30 control subjects for kallikreins hK5 (p<0.0001), hK7 (p=0.007), hK8 (p=0.005), hK10 (p=0.0003) and 
CA125 (p<0.0001) by the Maim- Whitney test. The diagnostic sensitivity (SENS) and specificity (SPEC) for 
each one of these markers were as follows (SENS/SPEC; both as %): 31/95 (hK5); 62/71 (hK7); 62/70 
(hK10); 54/54 (hKll); 89/94 (CA125). When these data were combined in a logistic regression model, 
kallikreins 5 and 10 did not contribute to a great extent to the sensitivity and specificity of CA125. The area 
35 under the curve of CA125 alone (93%) improved by a further 1% when adding hK6, by 2% when adding 
hKll, 3% when adding hK7 and 5% when adding hK8. The combination of CA125 and hK8 resulted in an 
AUC of 98%. 

Below is a summary of each marker and its ability to separate the cases and controls. 
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hK5: high values associated with cancer 
test* is hK5>0.10, test- is hK5<=0.10 
sensitivity=31%, specificity=95%, 
AUC=62, p(AUC)=.02 
5 Wilcoxon rank sum test has p<. 000 1 . 

Of the 233 persons analyzed, 207 have value zero for hK5 (27 cases, 180 controls). 
Possible good marker 

hK6: high values associated with cancer 
10 testf is hK6>6.3, test- is hK6<=6.3 
sensitivity=69%, specificity=40%, 
AUC=.50, p(AUC)=1.00 
Wilcoxon rank sum test has p= 91. 
Not a good marker 

15 

hK7: low values associated with cancer 
test* is hK7<2.05, test- is hK7>=2.05 
sensitivity=62%, specificity=71%, 
AUC=.64, p(AUC)=.006 
20 Wilcoxon rank sum test has p= 007. 
Possible good marker 

hK8: low values associated with cancer 
tesH- is hK8<13.0, test- is hK8>=13.0 
25 sensitivity=72%, specificity=42%, 
AUC-.64, p(AUC)-.006 
Wilcoxon rank sum test has p= 005 
Possible good marker 

30 hKlO: high values associated with cancer 

test+ is hK10>1.42, test- is hK10<=1.42 

sensitivity=62%, specificity=70%, 

AUC=.68, p(AUC)=.0004 

Wilcoxon rank sum test has p=.0003. 
35 Best single kallikrein marker 

hKl 1: high values associated with cancer 
test-l- is hKl 1>0. 14, test- is hKl 1<=0. 14 
sensitivity=54%, specificity=54%, 
40 AUC=.58, p(AUC)= 12 

Wilcoxon rank sum test has p=. 1 1 . 
Not a good marker 

CA125: high values associated with cancer 
45 testf is Cal25>34, test- is Cal25<=34 
sensitivity=89%, specificity= : 94%, 
AUC=.933, p(AUC)=<.0001 
Wilcoxon rank sum test has p<.0001. 
Good marker 

50 

After some further multivariate analysis of only the kallikrein markers, the combination of hK 7, 8, 10 and 
1 1 was a preferred set. This combination was arrived at by looking at the incremental AUG as markers were 
combined. Below is a summary of all the models tried: 



55 hK10 alone, AUC=68 
hK10+hK7: AUO.88 
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hK10+hK7+hK8: AUO.90 
hK10+hK7+liK8+hKll: AUC=925 

Multivariate model of hK7, hK8, hK10, hKl 1, call it hK7_8_10_l 1 

5 hK7_8_ll<Ml: 

Calculate SA=2.00- 1.49(hK7>.34(hK8)+l. 1 6(hKl 0)+3.50(hKl 1) 
high values associated with cancer 
testf- is SA>-1.15, test- is SA<=-1.15 
sensitivity=87%, specificity=89%, 
10 AUC=93,p(AUC)=<.0001 

Wilcoxon rank sum test has p<.0001. 
Good marker 

The hK marker that added the most to CA125 was also investigated. 
15 CA125 alone, AUC-.933 
CAl25+hK8: AUO.978 

Multivariate model of Cal25, hK8, call it Cal25_hK8 



Cal25_hK8: 
20 SC=-1.71-h086(Cal25)~.47(hK8). 

high values associated with cancer 

test+ is SO-2.52, test- is SC<-2.52 

sensitivity=97%, specificity=90%, 

AUC- 978, p(AUC)=<.0001 
25 Wilcoxon rank sum test has p<000 1 . 

Good marker 

Below is a summary of the above analyses: 

a) The preferred kallikrien marker alone is hK10, AUO=.68 

30 b) CA125 has an AUG of .933 

c) The preferred combination of kallikrein markers increases the AUC up to .925, which is close to 
CA125 AUC of .933 

d) Adding a kallikrein marker to CA125 can increase the AUC up to .978 



35 How does CA125 alone compare with the multivariate kallikrein model hK7_8_10_ll? 
(based on 39 cases and 186 controls evaluated with CA125) 



Sensitivity Specificity misclassification 

CA125 90% 94% 12FP, 4FN, total 16 pts misclassified 

40 hK7_8_10_l 1 85% 89% 3 1FP, 4FN, total 35 pts misclassified 

both positive 77% 100% 0FP, 9 FN, total 9 pts misclassified 

either positive 97% 82% 33FP, 1FN, total 34 pts misclassified 



How does CA125 alone compare with the multivariate model of CA125 plus hK8? 
45 (based on 39 cases and 186 controls evaluated with CA125) 

Sensitivity Specificity misclassification 

CA125 90% 94% 12FP, 4FN, total 1 6 pts misclassified 

CA125JiK8 95% 91% 17FP, 2 FN, total 19 pts misclassified 
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Kallikrein markers approach CA125 in terms of AUG and sensitivity, but their specificity is not as 
high. Adding hK8 to CA125 improves sensitivity but specificity is lower than CA125 alone. 
Summary 

5 a) The best kallikrein marker alone is hKl 0, area under the ROC curve (AUG) —.68. 

b) CA125 has an AUG of .933. Adding a single kallikrein marker to CA125 can get the AUG 
up to .978. Adding hK8 to CA125 improves sensitivity but specificity is lower compared 
with CA125 alone. 

c) The best combination of kallikrein markers gets the AUG up to .925, which is close to the 
10 CA125 AUG of .933. Kallikrein markers approach CA125 in terms of AUG and 

sensitivity, but their specificity is lower. 



The present invention is not to be limited in scope by the specific embodiments described herein, 

15 since such embodiments are intended as but single illustrations of one aspect of the invention and any 
functionally equivalent embodiments are within the scope of this invention. Indeed, various modifications of 
the invention in addition to those shown and described herein will become apparent to those skilled in the art 
from the foregoing description and accompanying drawings. Such modifications are intended to fall within 
the scope of the appended claims. 

20 All publications, patents and patent applications referred to herein are incorporated by reference in 

their entirety to the same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference in its entirety. All publications, 
patents and patent applications mentioned herein are incorporated herein by reference for the purpose of 
describing and disclosing the domains, cell lines, vectors, methodologies etc. which are reported therein 

25 which might be used in connection with the invention. Nothing herein is to be construed as an admission that 
the invention is not entitled to antedate such disclosure by virtue of prior invention. 

It must be noted that as used herein and in the appended claims, the singular forms "a", "an", and 
"the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a 
host cell" includes a plurality of such host cells, reference to the "antibody" is a reference to one or more 

30 antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Below full citations are set out for the references referred to in the specification. 
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Table 1 



Kallikrein Polypeptide 


Kallikrein Nucleic Acid 
Designation 


GenBank Accession No. 


Kallikrein 5 


KLK5 


AAD26429, AF135028, AF168768 


Kallikrein 6 


KLK6 


AAB664S3, AF0139S8(CDS 174.. 881), 
AF149289 (CDS join 3567.3606, 4346.. 4502, 
8122..8369, 9791.-9927,1 1805..11957) U62801 
(CDS 246..980) 


Kallikrein 7 


KLK7 


AAC37551, L33404 (CDS 16..777), AF166330 
(CDS join 3237..3309, 3722.3869, 4566..4813, 
5129..5265, 7362..7517) (mRNA join(1756..1785, 
3179..3309, 3722.3869, 4566..4813, 5129..5265, 
7362.. 8265) /products "stratum corneum 
chymotryptic enzyme" /note- 'alternatively 
spliced" ; mRNA join (1756.. 1785, 3179.3309, 
3722.3869, 4566..4813, 5129..5265, 7362..7991) 
/note="alternatively spliced"; mRNA join 
(1821.. 1864, 3179.3309, 3722.3869, 4566..4813, 
5129..5265, 7362.. 8265) /product="stratum 
corneum chymotryptic enzyme" 
/note="alternatively spliced"; mRNA join 
(1821. .1864, 3179.3309, 3722.3869, 4566..4813, 
5129..5265, 7362.. 7991) /note="alternatively 
spliced" 


Kallikrein 8 


KLK8 


BAA28673, AB009849 (CDS 35..817), 
AF095743 (CDS join 1035.. 1104, 16 19.. 1778, 
1944..2206, 4304..4437, 5974..6129, mRNA 
500..670, 1027.. 1104, 1619..1778, 1944..2206, 
4304..4437, 5974.. 6 174), AB010780 (CDS join 
1.39, 418. .712, 878..>946), AF055982 


Kallikrein 10 


KLK10 


AAC14266, AF055481 (CDS join 6 14.. 701, 
2455..2635, 3589.3863, 4195..4328, 4793..4945, 
mRNA join 48.. 120, 605..701, 2455..2635, 
3589.3863, 4195..4328, 4793..5474), 
NM_002776 (CDS 220.. 1050) 


Kallikrein 11 


KLK11 


BAA33404, AAD47815, AB012917 (CDS 
26.. 874), AF164623 (CDS 4224..4263, 
5061..5217, 5545..5810, 6627..6763, 7158..7310) 
(mRNA join (2313..2398, 4189..4263, 
5061..5217,5545..5810, 6627..6763,7158..7622) 
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Table 2 

Descriptive statistics for hk5, hk6 3 hk7, hk8, hklO and hkll serum protein levels in controls and 

patients with ovarian cancer 



hk5 dig/ ml) 
Non cancer (N=194) 
Cancer (N=3 9) 
% Increase** 



Mean 

0.063 
0.48 
661% 



Standard Error 

0.029 
0.18 



Median Range p value* 

0.00 0.00-4.50 
0.00 0.00-5.70 

<0.001 



hk6(ng/mn 

Non cancer (N=194) 
Cancer (N=39) 
% Increase** * 



6.96 
9.88 
42% 



0.18 
2.20 



6.60 
6.60 



1.60-15.30 
1.50-70.80 



0.91 



hk7(ng/ml) 

Non cancer (N=194) 
Cancer (N-39) 
% Decrease** 



2.60 
2.49 
4% 



0.071 
0.41 



2.67 
1.80 
33% 



0.30-6.00 
0.00-10.80 



0.007 



hk8(ng/mn 

Non cancer (N=194) 
Cancer (N=39) 
% Decrease** 



11.74 
11.91 



0.27 
1.88 



11.70 
6.90 
41% 



2.40-22.20 
0.00-46.20 



0.005 



hK10(ng/mr> 

Non cancer (N=194) 1.16 

Cancer (N=3 9) 6.51 

% Increase** 461% 



0.051 
2.46 



1.08 
1.59 
40% 



0.00-4.20 
0.27-90.0 



O.001 



hKll(ng/mn 

Non cancer (N=194) 0.21 

Cancer (N=39) 0.79 

% Increase** 276% 



0.018 
0.21 



0.12 
0.18 
50% 



00-1.30 
0.00-5.52 



0.011 



10 



* Calculated by the Mann Whitney test 

** Calculated by assuming that value in non-cancerouos tissue is 100% 
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Table 3 

Correlations between the studied variables in 194 non-cancer cases 



variable 



hK5 



hK6 



hK7 



hK8 



hK10 hKll CA125 



hK5 



hK6 



hK7 



hK8 



hK10 



hKll 



CA125 



1.000 

0.034 
0.642 
-0.053 
0.462 
0.066 
0.359 
0.134 
0.062 
0.150 
0.037 
0.101 
0.172 



0.034 



1.000 



0.114 



0.115 



-0.053 0.066 



0.642 0.462 



0.114 



0.115 



1.000 



-0.160 0.135 



0.134 



0.298 0.191 



0.000 0.000 



0.191 0.321 0.263 



0.008 0.000 0.000 



0.120 0.399 0.396 



0.097 0.000 0.000 



0.176 



0.014 



0.048 



0.150 0.101 



0.359 0.062 0.037 0.172 



0.120 



0.000 0.000 



1.000 0.176 



1.000 



0.035 0.125 



-0.160 



0.000 0.008 0.097 0.029 



0.497 0.321 0.399 0.135 



0.000 0.000 0.000 0.066 



0.298 0.497 1.000 0.263 0.396 0.048 



0.519 



0.035 



0.014 0.638 



0.125 



0.090 



1.000 



0.029 



0.066 0.519 



0.638 



0.090 
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Table 4 

Correlations between the studied variables in 39 ovarian cancer cases 



variable 



hK5 



hK6 



hK7 



hKS 



hK10 



hKll 



CA125 



hK5 



1.000 

0.475 
0.002 
0.553 
0.000 
0.554 
0.000 
0.618 
0.000 
0.584 
0.000 
0.507 
0.001 



hK6 



0.002 



0.327 



0.042 



0.661 



hK7 



1.000 0.327 



0.001 0.000 



0.003 0.000 



hXCS 



hlClO hKll CA125 



0.475 0.553 0.554 0.618 



0.000 0.000 



0.513 



0.000 
0.470 0.661 



0.000 



0.000 



0.000 0.000 0.000 0.000 



0.530 0.262 0.443 0.548 



0.584 0.507 
0.000 0.001 



0.042 0.001 0.003 0.000 



1.000 0.695 0.690 0.748 



0.470 0.690 0.602 1.000 0.706 



0.748 0.783 0.706 1.000 



0.556 



0.530 



0.001 



0.262 



0.000 0.000 0.000 0.107 



0.513 0.695 1.000 0.602 0.783 0.443 



0.000 0.005 



0.548 



0.000 0.000 



0.556 
0.000 
1.000 



0.001 0.107 



0.005 



0.000 0.000 
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We Claim ; 

1. A method for detecting a plurality of kallikrein markers associated with ovarian cancer in a patient 

comprising: 

5 (a) obtaining a sample from a patient; 

(b) detecting in the sample a plurality of kallikrein markers and optionally CA125, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(c) comparing the detected amounts with amounts detected for a standard. 

10 2. A method for diagnosing and monitoring ovarian cancer in a subject comprising detecting in a 

sample from the subject a plurality of kallikrein markers, wherein the kallikrein markers comprise 
or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein. 

3. A method as claimed in claim 1 or 2 wherein the plurality of kallikrein markers are detected using 
15 antibodies that bind to each of the plurality of kallikrein markers or parts thereof 

4. A method as claimed in claim 1, 2, 3 which further comprises detecting CA125. 

5. A method of detecting ovarian cancer in a patient, the method comprising comparing: 

(a) levels of a plurality of kallikrein markers, and optionally CA125, in a sample from the 
patient, wherein the kallikrein markers comprise or are selected from the group consisting 

20 of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(b) normal levels of expression of the plurality of kallikrein markers, and optionally CA125, in 
a control sample, wherein a significant difference in levels of kallikrein markers and 
optionally CA125, relative to the corresponding normal levels, is indicative of ovarian 
cancer. 

25 6. A method for monitoring the progression of ovarian cancer in a patient, the method comprising: (a) 

detecting in a sample from the patient at a first time point, a plurality of kallikrein markers, wherein 
the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 
6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; (b) repeating step (a) at a subsequent 
point in time; and (c) comparing levels detected in steps (a) and (b), and thereby monitoring the 

30 progression of ovarian cancer. 

7. A method for determining in a patient whether ovarian cancer has metastasized or is likely to 
metastasize in the future, the method comprising comparing (a) levels of a plurality of kallikrein 
markers, and optionally CA125, in a patient sample, wherein the kallikrein markers comprise or are 
selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 

35 10, and kallikrein 11; and (b) normal levels or non-metastatic levels of the kallikrein markers and 

optionally CA125, in a control sample wherein a significant difference between the levels of 
expression in the patient sample and the normal levels or non-metastatic levels is an indication that 
the ovarian cancer has metastasized. 

8. A method for assessing the aggressiveness or indolence of ovarian cancer comprising comparing: 
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(a) levels of expression of a plurality of kallikrein markers, and optionally CA125, in a patient 
sample, wherein the kallikrein markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (b) normal 
levels of expression of the plurality of markers and optionally CA125, in a control sample, wherein 
5 a significant difference between the levels in the patient sample and normal levels is an indication 

that the cancer is aggressive or indolent. 

9. A method for diagnosing and monitoring ovarian cancer in a sample from a subject comprising 
isolating nucleic acids from the sample; and detecting in the sample polynucleotides encoding a 
plurality of kallikrein markers, and optionally CA125, wherein the kallikrein markers comprise or 

10 are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 

kallikrein 10, and kallikrein. 

10. A method as claimed in claim 9 wherein significant differences in the levels of the polynucleotides 
in the sample compared to a control is indicative of disease, disease stage, and/or prognosis. 

11. A method for determining the presence or absence of ovarian cancer in a subject comprising: (a) 
15 contacting a sample obtained from the subject with oligonucleotides that hybridize to 

polynucleotides encoding kallikrein markers, and optionally CA125, wherein the kallikrein markers 
comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11; and (b) detecting in the sample a level of nucleic acids 
in the sample that hybridize to the polynucleotides relative to a predetermined cut-off value, and 
20 therefrom determining the presence or absence of ovarian cancer in the subject. 

12. A method as claimed in claim 11, wherein the nucleic acids are mRNA and the levels of nucleic 
acids are detected by polymerase chain reaction. 

13. A method as claimed in claim 1 1 wherein the nucleic acids are mRNA and the amounts of mRNA 
are detected using a hybridization technique, employing oligonucleotide probes that hybridize to 

25 kallikrein markers, and optionally CA125. 

14. A method for assessing the potential efficacy of a test agent for inhibiting ovarian cancer in a 
patient, the method comprising comparing: (a) levels of a plurality of kallikrein markers, optionally 
CA125, and/or polynucleotides encoding same, in a first sample obtained from a patient and 
exposed to the test agent, wherein the kallikrein markers comprise or are selected from the group 

30 consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

(b) levels of the plurality of kallikrein markers, optionally CA125, and/ or polynucleotides encoding 
same, in a second sample obtained from the patient, wherein the sample is not exposed to the test 
agent, wherein a significant difference in the levels of expression of the plurality of kallikrein 
markers, optionally CA125, and/or polynucleotides encoding same, in the first sample, relative to 

35 the second sample, is an indication that the test agent is potentially efficacious for inhibiting ovarian 

cancer in the patient. 

15. A method of claim 14 wherein the first and second samples are portions of a single sample obtained 
from the patient. 

16. A method of claim 14 wherein the first and second samples are portions of pooled samples obtained 
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from the patient. 

17. A method of assessing the efficacy of a therapy for inhibiting ovarian cancer in a patient, the 
method comprising comparing: (a) levels of a plurality of kallikrein markers, optionally CA125, 
and/or polynucleotides encoding same, in a first sample obtained from the patient, wherein the 

5 kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 

kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and (b) levels of the kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, in a second sample obtained from the 
patient following therapy, wherein a significant difference in the levels of expression of the 
kallikrein markers, optionally CA125, and/or polynucleotides encoding same, in the second sample, 
10 relative to the first sample, is an indication that the therapy is efficacious for inhibiting ovarian 

cancer in the patient. 

18. A method of selecting an agent for inhibiting ovarian cancer in a patient the method comprising (a) 
obtaining a sample comprising cancer cells from the patient; (b) separately exposing aliquots of the 
sample in the presence of a plurality of test agents; (c) comparing levels of a plurality of kallikrein 

15 markers, optionally CA125, and/or polynucleotides encoding same, in each of the aliquots, wherein 

the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 
6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (d) selecting one of the test agents 
which alters the levels of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, in the aliquot containing that test agent, relative to other test agents. 

20 19. A method of inhibiting ovarian cancer in a patient, the method comprising (a) obtaining a sample 
comprising cancer cells from the patient; (b) separately maintaining aliquots of the sample in the 
presence of a plurality of test agents; (c) comparing levels of a plurality of kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, in each of the aliquots, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 

25 kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (d) administering to the patient at 

least one of the test agents which alters the levels of kallikrein markers, optionally CA125, and/or 
polynucleotides encoding same, in the aliquot containing that test agent, relative to other test agents. 

20. A method of assessing the ovarian cell carcinogenic potential of a test compound, the method 
comprising: (a) maintaining separate aliquots of ovarian cells in the presence and absence of the test 

30 compound; and (b) comparing expression of a plurality of markers, optionally CA125, and/or 

polynucleotides encoding same, in each of the aliquots, wherein the markers comprise or are 
selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 
10, and kallikrein 1 1, and wherein a significant difference in levels of kallikrein markers, optionally 
CA125, and/or polynucleotides encoding same, in the aliquot maintained in the presence of the test 

35 compound, relative to the aliquot maintained in the absence of the test compound, is an indication 

that the test compound possesses ovarian cell carcinogenic potential. 

21. A method of inhibiting ovarian cancer in a patient at risk for developing ovarian cancer, the method 
comprising inhibiting expression of genes encoding kallikrein markers and optionally CA125, 
wherein the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
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kallikrein 6, kallikrein 7, kallikrein 8 5 kallikrein 10, and kallikrein 11. 

22. A method of any preceding claim wherein the plurality comprises at least three of the markers. 

23. A method of any preceding claim wherein the plurality comprises at least five of the markers. 

24. A method of any preceding claim wherein the plurality of kallikrein markers is selected from the 
5 group consisting of kallikrein 5, kallikrein 7, and kallikrein 8; kallikrein 5, kallikrein 8, and 

kallikrein 10; kallikrein 7, kallikrein 8, and kallikrein 10; kallikrein 5, kallikrein 7, kallikrein 8, and 
kallikrein 10; kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; kallikrein 5, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11; or kallilkrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10 and kallikrein 11. 
10 25. A method of any proceeding claims wherein the kallikrein markers are kallikrein 7, kallikrein 8, 
kallikrein 10 and kallirkein 11. 

26. A method of any preceding claim wherein the patient sample comprises serum obtained from the 
patient. 

27. A kit for carrying out a method as claimed in any preceding claim. 

15 28. A kit for assessing whether a patient is afflicted with ovarian cancer, the kit comprising reagents 
that specifically bind with a plurality of kallikrein markers and optionally CA125, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 
kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 

29. A kit for assessing the suitability of each of a plurality of agents for inhibiting ovarian cancer in a 
20 patient, the kit comprising: (a) the plurality of agents; and (b) reagents for detecting a plurality of 

kallikrein markers and optionally CA125, wherein the kallikrein markers comprise or are selected 
from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and 
kallikrein 11. 

30. A kit as claimed in claim 28 or 29 wherein the reagents are antibodies that specifically bind with 
25 protein or protein fragments corresponding to kallikrein markers and optionally CA125. 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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Figure 6 
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Figure 7 
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Figure 8 
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Sequence Listing 



SEQ ID NO. 1 

CA125 amino acid 

5 1 ralkpsglpgs ssptrslmtg srstkatpera dsgltgatls pktstgaiw tehtlpf tsp 

SI dktlasptss wgrttqslg vmssalpest srgmthseqr tspslspqvn gtpsrnypat 
121 smvsglsspr trtsstegnf tkeastytlt vettsgpvte kytvptetst tegdstetpw 
181 dtryipvkit spmktfadst askenapvsm tpaettvtds htpgrtnpsf gtlyssfldl 
241 spkgtpnsrg etslelilst tgypf sspep gsaghsrist saplsssasv ldnkisetsi 
10 301 fsgqsltspl spgvpearas tmpnsaipf s mtlsnaetsa ervrstissl gtpsistkqt 

361 aetiltfhaf aetmdipsth iaktlasewl gspgtlggts tsaltttsps ttlvseetnt 
421 hhstsgkete gtlnt smtpl etsapgeese mtatlvptlg fttldskirs psqvssshpt 
481 relrttgsts grqssstaah gssdilratt sstskasswt sestaqqf se pqhtqwvets 
541 psmkterppa stsvaapitt svpsvvsgf t tlktsstkgi wleetsadtl igestagptt 
15 601 hqf avptgis mtggsstrgs qgtthlltra tassetsadl tlatngvpvs vspavsktaa 

661 gssppggtkp sytmvssvip etsslqssaf regtslgltp lntrhpf ssp epdsaghtki 
721 stsipllssa svledkvsat stfshhkats sittgtpeis tktkpssavl ssmtlsnaat 
781 spervmats plthpspsge etagsvltls tsaettdspn ihptgtltse ssespstlsl 
841 psvsgvkttf ssstpsthlf tsgeeteets npsvsqpets vsrvrttlas tsvptpvfpt 
20 901 mdtwptrsaq fssshlvsel ratsstsvtn stgsalpkis hltgtatmsq tnrdtfndsa 

961 apqsttwpet sprfktglps atttvstsat slsatvmvsk ftspatssme atsirepstt 
1021 ilttettngp gsmavastiii pigkgyiteg rldtshlpig ttassetsmd ftmakesvsm 
1081 svspsqsmda agsstpgrts qfvdtfsddv yhltsreiti prdgtssalt pqmtathpps 
1141 pdpgsarstw lgilssspss ptpkvtmsst fstqrvttsm imdtvetsrw nmpnlpstts 
25 1201 ltpsniptsg aigkstlvpl dtpspatsle asegglptls typestntps ihlgahasse 

12 61 spstikltma svvkpgsytp ltfpsiethi hv star mays sgssperatap getntgstwd 
1321 pttyitttdp kdtssaqvst phsvrtlrtt enhpktesat paaysgspki ssspnltspa 

13 81 tkawtitdtt ehstqlhytk laekssgfet qsapgpvsw iptsptigss tleltsdvpg 
1441 eplvlapseq ttitlpmatw lstslteema stdldissps sprastfaifp pmstpshels 

30 1501 kseadtsair ntdsttldqh lgirslgrtg dlttvpitpl tttwtsvieh stqaqdtlsa 

1561 traspthvtqs Ikdqtsipas aspshltevy pelgtqgrss seattfwkps tdtlsreiet 
1621 gptniqstpp mdntttgsss sgvtlgiahl pigtsspaet stnmalerrs statvsmagt 
1681 mgllvtsapg rsisqslgrv ssvlsestte gvtdsskgss prlntqgnta lssslepsya 
1741 egsqmstsip ltsspttpdv efiggstfwt kevttvmtsd iskssartes ssatlmstal 

35 1801 gstentgkek lrtasmdlps ptpsmevtpw isltlsnapn ttdsldlshg vhtssagtla 

1861 tdrslntgvt rasrlengsd tsskslsmgn sthtsmtdte ksevsssihp rpetsapgae 
1921 ttltstpgnr aisltlpfss ipveevistg itsgpdinsa pmthspitpp tivwtstgti 
1981 eqstqplhav ssekvsvqtq stpyvnsvav saspthensv ssgsstsspy ssasleslds 
2041 tisrrnaits wlwdlttslp tttwpstsls ealssghsgv snpsstttef plfsaastsa 

40 2101 akqrnpetet hgpqntaast lntdassvtg lsetpvgasi ssevplpmai tsrsdvsglt 

2161 sestanpslg tassagtklt rtislptses lvsfrmnkdp wtvsiplgsh pttntetsip 
2221 vnsagppgls tvasdvidtp sdgaesiptv sfspspdtev ttishfpekt thsfrtissl 
22 81 theltsrvtp ipgdwmssam stkptgasps itlgerrtit saapttspiv ltasftetst 
2341 vsldnettvk tsdildarkt nelpsdssss sdlintsias stmdvtktas isptsisgmt 

45 2401 assspslfss drpqvptstt etntatspsv ssntysldgg snvggtpstl ppftithpve 

2461 tssallawsr pvrtf stmvs tdtasgenpt ssnsvvtsvp apgtwasvgs ttdlpamgfl 
2521 ktspageahs llastiepat aftphlsaav vtgssatsea sllttseska ihsspqtptt 
2581 ptsganwets atpesllvvt etsdttltsk ilvtdtilfs tvstppskfp stgtlsgasf 
2 641 ptllpdtpai pltateptss latsfdstpl vtiasdslgt vpettltmse tsngdalvlk 

50 2701 tvsnpdrsip gitiqgvtes plhpsstsps kivaprntty egsitvalst lpagttgslv 

2761 fsqssenset talvdssagl erasvmpltt gsqgmassgg irsgsthstg tktfsslplt 
2821 mnpgev tarns eittnrltat qstapkgipv kptsaesgll tpvsasssps kafaslttap 
2881 pstwgipqst ltfefsevps ldtksaslpt pgqslntipd sdastasssl skspeknpra 
2 941 rramtstkais assf qstgf t etpegsasps magheprvpt sgtgdpryas esmsypdpsk 

55 3001 assamtstsl asklttlf st gqaarsgsss spislsteke tsf lsptast srktslf lgp 

3061 sraarqpnilv hlqtsaltls ptstlnmsqe eppeltssqt iaeeegttae tqtltftpse 
3121 tptsllpvss pteptarrks spetwassis vpaktslvet tdgtlvttik mssqaaqgns 
3181 twpapaeetg tspagtspgs pevsttlkim sskepsispe irstvrnspw ktpettvpme 
3241 ttvepvtlqs talgsgstsi shlptgttsp tksptenmla tervslspsp peawtnlysg 
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10 
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3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 



tpggtrqsla 
vslstssnil 
deaysstssw 
gktssassvt 
ttittragtns 
nlkvarspgt 
ertlspsdtt 
dpntplstf 1 
ighitsavtp 
phtaktpdat 
sssvlkdpey 
ssehashsti 
dvspymdtss 
lsnfpamtes 
skgpedtsqp 
glgkttdmsr 
yspipghtkp 
ssatetstvi 
mtessgvtit 
vtwtsppsva 
tsmepvtnsp 
sepstatspm 
niilsnvsvg 
tttqtgssga 
qdeasspssq 
sspqsmsntl 
spnvttstme 
eltgattevs 
gphgatsqdt 
spssf lsspa 
stsdkilats 
tvgdpapsts 
Itevsstgvn 
ashstlpldt 
mssparatsps 
therpatykd 
lgdtsvstst 
asrteisssr 
ilpwdtsttl 
mtspspvsst 
ttykdtahte 
svststpaf f 
vitssrttis 
dtsstasweg 
hsslysavsg 
stntetihf s 
gspetknvdr 
tf mpasaqst 
tsagtpsart 
tlqehstssl 
intamanvgt 
ettf slifrl 
mspdtstrsv 
hgf sqldlst 
vhltslptsg 
tttsekesys 
lkgtstsqdp 
pslpislgit 
tvmnkdpeil 
lvmttdtlgt 
gsqssvltds 



tmssvslesp 
edpvtspnsv 
sdqtsgsdit 
spsigletlr 
istttpnpev 
istmhttsf 1 
astpistf sr 
f dslstldwd 
aamarssgvt 
f qrqgqtalt 
aghklgiwdd 
pahsassklt 
ttqtsiissp 
ggmilamqts 
sppsveetss 
islepgtslp 
skatsplvts 
thvssgdatt 
tqtgptgaat 
etsypssltp 
qnlnnpsnei 
vpassmgdal 
aiteatkmev 
tskiplaldt 
apvlvttlps 
ddisvtsaat 
dttisrsipk 
rtdvtsssst 
f tradpsnttp 
mttpslisst 
kdskdtkeif 
mpvhgssett 
ssskistpdh 
sttlsqggth 
pvsstspqsi 
tahteaamhh 
pnisqtnqiq 
tsisdldrpt 
f qggthstvs 
spesipsspl 
amhasmhtnt 
etriqtests 
gpdhskraspy 
thspvtqrf p 
ssptsalpvt 
entaetnmgt 
dstspltpel 
kspdispeas 
qdf vdsetts 
vsvtsvptpt 
tsspnefyf t 
retstsqkig 
tmlstf aglt 
Itsrvpeyis 
lvkttdmlas 
svpayseppk 
ivsteksavl 
essnmtiitr 
swtippsiek 
spepttsspp 
ektkatapmd 



tarsitgtgq 
ssltdkskhk 
Igaspdvtnt 
anvsavksdi 
gmstmdstpa 
assteldsras 
vqrmsisvpd 
tgrslssata 
f srpdptskk 
tearatsdsw 
f ipkf gkaah 
spvvttstre 
gstaitkgpr 
ppgatslsap 
ssslvpihat 
pnlsstagea 
himgditsst 
hvtktqatf s 
qgpylldtst 
f lvttippat 
latlaattdi 
asisipgset 
psf datf ipt 
stletsagtp 
svaf tpqwhs 
tdietthpsi 
sskttrtete 
sf pgpdqstv 
qagihsamth 
lpedklsspm 
psinteetnv 
nikreptyf 1 
dkstvppdtf 
stvtqgfpys 
pssplpvtal 
stntavtnvg 
teptaslspr 
iapdistgrai 
qgf phseitt 
pvtalltsvl 
avanvgtsis 
slipglrdtr 
istetitrls 
hseetttmsr 
slltsgrrkt 
tnsmhklhss 
kedstalvmn 
sshsnspplt 
vmnndlndvl 
lakitdmdtn 
vspdsdpyka 
sssdtstvf d 
kseertiatq 
gtsppsvekt 
vaslppnlgs 
vtspravtsf n 
hklttgatet 
tgpplgstsq 
tsf ssslmps 
nlsstshvil 
ttstmghttv 



qsspelvskt 
tetwvsttai 
lyitstaqtt 
aptaghlsqt 
terrttsteh 
tphgritvig 
ilstswtpss 
ttsapqgatt 
aeqtstqlpt 
nekekstpsa 
mrelpllspp 
qaivsmsttt 
teitsskris 
tldtsatasw 
tspsnillts 
lstyeasrdt 
svf gssette 
sgtsissphq 
mpyltetpla 
stlqgqhtss 
etihpsinka 
tdiegeptss 
paqstkfpdi 
svvtegf ahs 
tsspvsmssv 
ntvvtnvgtt 
ttssltpklr 
sldistetnt 
gf sqldvttl 
tslltsglvk 
kannsghesh 
tprlretsts 
tgeiprvf ts 
evttlmgmgp 
ptsvlvtttd 
tsgsghksqs 
lresstsekt 
trlf tspirat 
lrsrtpgdvs 
vtttnvlgtt 
ghesqssvpa 
tseeintvte 
tf pfvtgste 
stkgvswqsp 
idmldthsel 
vsihsqpsgh 
sttesntvf s 
isthktiatq 
ktspf saeea 
lepvtrspqn 
tsawitsts 
kaf taattev 
tgphratsqg 
sssssllslp 
tshkipttse 
irdtivstsm 
srtevassrr 
gtf tldtptt 
pamtsppvss 
ttdedttaie 
stsmsvsset 



tgmef smwhg 
pstvlnnkim 
slvslpsgdq 
sspaevsild 
pstwsstaas 
tslvtpssda 
teaedvpvsm 
pqeltletmi 
ttsahpgqvp 
pwitemrnnsv 
qdkeaihpst 
wpestrarte 
ssflaqsmrs 
tgtplattqr 
qghspsstpp 
kaihhsadta 
ietvssvnqg 
f itstntf td 
vtpdfmqsek 
pvsatsvlts 
vtnmgtassa 
ltagrkenst 
f svassrlsn 
kittamnndv 
ltsslvktag 
gsaf eshstv 
etsisqeits 
rlstspimte 
msripqdvsw 
xtdilrtrle 
spaladsetp 
qessfptdts 
siktksaemt 
gnvswmttpp 
vlgttspesv 
svladsetsk 
ssttetntaf 
ksaemtvttq 
wmttppveet 
spetvtsspp 
dshtskatsp 
tstvlsevpt 
maitnqtgpi 
psveetssps 
vtsslpsass 
tppkvtgsram 
svsldaatev 
tgpsgvtslg 
nslssqapll 
lrntlatsea 
gdsivstsmp 
srteltsssr 
tltwdtsitt 
aitspspvpt 
dikdtekmyp 
pgsseitrie 
tsipgpdhst 
ssragthsma 
tlpktihttp 
amhpststaa 
tkikrestys 



stggttgdth 
aaeqqtsrsv 
gitsltnpsg 
vttaptpgis 
dswtvtdmts 
savktetsts 
vstdhastkt 
spatsqlpf s 
rsaattldvi 
sedtikevts 
ntvettgwvt 
pnsf Itielr 
sdspseaitr 
f tysekttlf 
vtsvflsets 
vtnmeatsse 
lqerstsqva 
vstnpstsli 
ttliskgpkd 
glvkttdmln 
hvlhstlpvs 
lqemnsttes 
sppmtisthra 
kdvsqtnppf 
kvdtsletvt 
saypepskvt 
stetstvpyk 
saeitittqt 
tsppsvdkts 
pvtsslpnf s 
kattqmvitt 
f llskvptgt 
ittqasppes 
veetssvssl 
tssppnlssi 
atplmsttst 
syvptgaitq 
tttpgatsqg 
ssgf slmsps 
nlssptqerl 
mgitf amgdt 
ttttevsrte 
gtisqatltl 
spvplpaits 
f sgeiltsea 
edaivststp 
sraevtyydp 
qltldtstia 
vttspspvts 
ttdthtmhps 
rssamkkies 
tsiqgtekpt 
sqagthsamt 
tlpesrpssp 
stniavtnvg 
mestf svahg 
espdistevi 
tqef phsemt 
sprat slltps 
tnvettcsgh 
ltpglretsi 
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6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 
8941 
9001 
9061 
9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 



sqnasf stdt 
sptmtesaera 
pgdmswqssp 
dmlhtspelv 
aladsetska 
saietsavls 
aessemtikt 
prpsslpvee 
asaepetssp 
settkatsara 
antvlskvst 
mtittqtgps 
pfvketspps 
gtssssslst 
ttstmgdtsv 
atteiskedv 
tstlatsstt 
attspspvss 
atsevttdte 
ststpgf f et 
tsssrtsisg 
isttpnwvet 
spvsstlved 
tvdtedmqps 
df f etsriqi 
tsmsgpdqf t 
f wsgthstas 
lpesisssph 
ihpssntpw 
mtqptsslts 
stispeiste 
athrsphsgm 
splrvtslf t 
avtqmgtisa 
tltptprets 
sevitrlsts 
mtalmsrtpg 
tsglvkttel 
ssvladsvtt 
tssatekstv 
rkestdraait 
vswpsplsve 
daslepetts 
f septkvisp 
etstvlyktns 
eitittqtgy 
rf vettrsss 
ktssspnlss 
tipsmgitsa 
ygvptsatte 
qksspgataq 
sssssllslp 
tsveilaase 
maetsistsm 
sktdf tssak 
thhlstdllp 
tiaeslpsst 
vstldtssqp 
eaahrgtirp 
ytptlgtltp 
svlnresett 



sivlsevptg 
tiptqtgpsg 
slenpsslps 
tssppklsht 
tspmf itstq 
evsigattei 
qtsppgstse 
tsppssqlsl 
pslsstsvei 
gtisimgdts 
gattevsrte 
estlestlnl 
splslpavts 
tsherlttyk 
Itstpaf let 
tsipgpaqst 
sltmthstis 
tlpesisssp 
kthpssnrtv 
sriqteptss 
f aqltvspet 
hstvtqrf sh 
fpsaslpvts 
tht avtnvrt 
eptssltsgl 
ispdisteai 
pgf shsemtt 
pvtalltlgp 
nvgtviykhl 
glreistsqe 
titristplt 
ttpmsrgped 
pvmmkttdml 
rqefyssypg 
tsqeihsatk 
pikaestemt 
dvpwlshpsv 
lgtssepets 
katssmgity 
lssvptgatt 
pktgpsgats 
knsppsslvs 
apnranitsde 
wtsssirdn 
sgatpevsrt 
slatsqvtlp 
sltslpltts 
tsveipatse 
vddttvf tsn 
vsrateimssn 
stltlattta 
vttspsvsst 
vttdtekihp 
panf ettgf e 
tsspdwppas 
saetistgtv 
pvpf ssstf t 
grtsstpild 
vkgpqtstsp 
lnasrqmast 
aslvsrsgae 



ttaevsrtev 
stsqdtltld 
llslpattsp 
sderlttgkd 
edttvaistp 
srtevtsssr 
stftldtstt 
samispspvs 
latsevttdt 
vstltpalsn 
aisf srtsms 
ntattpswve 
phpvsttf la 
dtahteavhp 
rriqtelass 
ispdistrtv 
qgf shsqmst 
lpvtslltsg 
tdvgtsssgh 
ltlglrktss 
stetitrlpt 
semttlvsrs 
lltpglvitt 
sisghesqss 
retssseris 
trlstspimt 
lmsrtpgdvp 
vkttdralrts 
spssvladlv 
tssatersas 
ttgsaeratit 
vswpsrpsve 
dtslepvtts 
Ipepskvtsp 
pstvpykalt 
ittqtgspga 
eeassasf si 
sppnlsstsa 
ptgdtnvlts 
evsrteaiss 
qgtf tldsss 
sssvtspspl 
slatskatte 
mvsttmpgss 
evmpssrtsi 
Igtsmtf lsg 
lspvsstlld 
imtdtekihp 
paf setrrip 
rthipdsdqs 
plarthstvp 
lpqsipsssf 
s s smavtnvg 
aepf shltsg 
qyteipvdii 
mpslseamts 
ttdsstipal 
trmtesvelg 
aspkglhtgg 
iltemmittp 
rspviqtldv 



tssgrtsipg 
tsttksqakt 
ppisstlpvt 
ttnteavhps 
hf letsriqk 
tsisgsaest 
pslvithstm 
stlpasshss 
ekihpfpnta 
trkiqsepas 
gpeqstmsqd 
thsiviqgfp 
hippsplpvt 
stntggtnva 
ltpglressg 
swf stspvmt 
lmrrgpedvs 
lakttdmlhk 
estsf vlads 
segtslatem 
ssimtesaem 
pgdmlwpsqs 
drmgisrepg 
vlsdsetpka 
sategstvls 
esaesaitie 
wpslpsveea 
sepetssppn 
ttkptspmat 
lsgmptgatt 
pktghsgass 
ktsppsslvs 
ppsmnitsde 
vvtsstikdi 
satiedsmtq 
tsrgtltldt 
sspvmtsssp 
eilattevtt 
tpaf sdtsri 
srtsipgpaq 
taswpgthsa 
ystpsgsshs 
teaihvf ent 
gitrieiesm 
pgpaqstmsl 
thstmsqgls 
sspssplpvt 
ssntavakvr 
teptf sltpg 
tmspdiitev 
prf Ihsemtt 
svtslltpgm 
ttssghelys 
frktnmsldt 
tpfnaspsit 
f attgvprai 
heitsssatp 
tvtsayqvps 
tkrmetttta 
yvfpdvpett 
sssepdttas 



psqstvlpei 
hstltqrfph 
isssplpvts 
tntaasnvei 
esisslspkl 
mlpeisttrk 
tqrlphseit 
sasvtspltp 
vtkvgtsssg 
slttrlrets 
isigtipris 
hpemttsragr 
slltsgpatt 
ttssgyksqs 
segtssgtkm 
esaeitmnth 
wmsppllekt 
ssepvtnspa 
qtskvtsprav 
stvlsgvptg 
raiktqtdppg 
sveetssass 
tsstsnlsst 
tspmgttytm 
evpsgattev 
tgspgatseg 
ssvssslssp 
lsstsaeila 
tstlgntsvs 
kvsrtealsl 
qgtf tldtss 
lsavtspspl 
slatskatme 
vsttipasse 
vmsssrgpsp 
sttfmsgths 
vsstlpdsih 
dteklemtnv 
qtksklsltp 
stmssdt srae 
ttqrf pqsw 
spvpvtslf t 
aashvettsa 
ssltpglret 
disdevvtrl 
hsemtnlrasr 
slilpglvkt 
tsssvheshs 
f retstseet 
itrlssssmm 
Imsrspenps 
vkttdtstep 
svsihsepsk 
ssvtptntps 
estgitsfpe 
sgsgspf srt 
yrvdtslgte 
lstrltrtdg 
lkttttalkt 
sslatslgae 
wvihpaetip 



strtmtrlf a 
semttlmsrg 
lltsspvttt 
psfghespss 
retgssvets 
iikfptspil 
tlvsrgagdv 
gqvkttevld 
hespssvlpd 
tseetslate 
assvltesak 

gpggvswpsp 

tdilgtstep 
svladsspmc 
stvlskvptg 
tsplgattqg 
rpsf slmssp 
nlsstsveil 
itstraedtsv 
ataevsrtev 
stpesthtvd 
llslpattsp 
sherlttled 
getsvsists 
srtevissrg 
tltldtsttt 
amtstsf f sa 
tsevtkdrek 
tstpafpetm 
grtstpgpaq 
raswpgthsa 
ystpsesshs 
teaiqlsent 
itriemests 
dqstmsqdis 
tasqgf shsq 
ssslpvtsll 
vtsgythesp 
glmetsisee 
titristplt 
ttpmsrgped 
simmkatdml 
teelyssspg 
rtsqditsst 
stspimtesa 
gpeslswtsp 
tevl dtssep 
svladsetti 
tsitetsavl 
sestqmtitt 
wksspfvekt 
gtslspnlsg 
atypvgtpss 
spgsthllqs 
srf tmsvtes 
esgpgdatls 
ssttegrlvm 
imehitkipn 
tsratlttsv 
tstalprttp 
tvskttpnf f 
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10621 hseldtvsst atshgadvss aiptnispse ldaltplvti sgtdtsttfp tltksphete 
10681 trttwlthpa etsstiprti pnfshhesda tpsiatspga etssaipimt vspgaedlvt 
10741 sqvtssgtdr nmtiptltls pgepktiasl vthpeaqtss aiptstispa vsrlvtsravt 
10801 slaaktsttn raltnspgep attvslvthp aqtsptvpwt tsiffhsksd ttpsmttshg 
5 10861 aesssavptp tvstevpgw tplvtssrav isttipiltl spgepettps matshgeeas 

10921 saiptptvsp gvpgvvtslv tssravtstt ipiltfslge pettpsraats hgteagsavp 
10981 tvlpevpgmv tslvassrav tsttlptltl spgepettps matshgaeas stvptvspev 
11041 pgvvtslvts ssgvnstsip tlilspgele ttpsmatshg aeassavptp tvspgvsgvv 
11101 tplvtssrav tsttipiltl sssepettps matshgveas savltvspev pgmvtslvts 
10 11161 sravtsttip tltissdepe tttslvthse akmisaiptl avsptvqglv tslvtssgse 

11221 tsafsnltva ssqpetidsw vahpgteass wptltvstg epftnislvt hpaessstlp 
11281 rttsrf shse ldtmpstvts peaesssais ttispgipgv ltslvtssgr disatfptvp 
11341 espheseata swvthpavts ttvprttpny shsepdttps iatspgaeat sdfptitvsp 
11401 dvpdmvtsqv tssgtdtsit iptltlssge petttsf ity sethtssaip tlpvspgask 
15 11461 mltslvissg tdstttfptl tetpyepett aiqlihpaet ntmvpkttpk f shsksdttl 

11521 pvaitspgpe assavsttti spdmsdlvts Ivpssgtdts ttfptlsetp yepettvtwl 
11581 thpaetsttv sgtipnf shr gsdtapsmvt spgvdtrsgv ptttippsip gwtsqytss 
11641 atdtstaipt ltpspgepet tassathpgt qtgf tvpirt vpssepdtma swvthppqts 
11701 tpvsrttssf shsspdatpv matsprteas savlttispg apemvtsqit ssgaatsttv 
20 11761 ptlthspgmp ettallsthp rtgtsktfpa stvfpqyset tasltirpga etstalptqt 

11821 tsslftllvt gtsrvdlspt aspgvsakta plsthpgtet stmiptstls lgllettgll 
11881 atsssaetst stltltvspa vsglssasit tdkpqtvtsw ntetspsvts vgppefsrtv 
11941 tgttmtlips emptppktsh gegvspttil rttmveatnl attgssptva kttttfntla 
12 001 gslftplttp gmstlasesv tsrtsynhrs wisttssynr rywtpatstp vtstfspgis 
25 12 061 tssipsstaa tvpfmvpftl nftitrxlqye edmrhpgsrk fnaterelqg llkplfrnss 

12121 leylysgcrl aslrpekdss amavdaicth rpdpedlgld rerlywelsn Itngiqelgp 
12181 ytldrnslyv ngfthrssmp ttstpgtstv dvgtsgtpss spsptaagpl lmpftlnfti 
12241 tnlqyeedmr rtgsrkfntm esvlqgllkp lfkntsvgpl ysgcrltllr pekdgaatgv 
123 01 daicthrldp kspglnreql ywelskltnd ieelgpytld rnslyvngft hqssvsttst 
12361 pgtstvdlrt sgtpsslssp timaagpllv pftlnftitn lqygedmghp gsrkfntter 
12421 vlqgllgpif kntsvgplys gcrltslrse kdgaatgvda icihhldpks pglnrerlyw 
12481 elsqltngik elgpytldrn slyvngfthr tsvpttstpg tstvdlgtsg tpfslpspat 
12541 agpllvlftl nftitnlkye edmhrpgsrk fnttervlqt llgpmfknts vgllysgcrl 

12 601 tllrsekdga atgvdaicth rldpkspgld reqlywelsq ltngikelgp ytldrnslyv 
35 12 661 ngfthwipvp tsstpgtstv dlgsgtpssl psptaagpll vpf tlnf tit nlqyeedmhh 

12721 pgsrkfntte rvlqgllgpm fkntsvglly sgcrltllrs ekdgaatgvd aicthrldpk 
12781 spgvdreqly welsqltngi kelgpytldr nslyvngfth qtsapntstp gtstvdlgts 
12841 gtpsslpspt sagpllvpft lnftitnlqy eedmrhpgsr kfnttervlq gllkplfkst 
12901 svgplysgcr ltllrsekdg aatgvdaict hrldpkspgv dreqlywels qltngikelg 
40 12961 pytldrnsly vngfthqtsa pntstpgtst vdlgtsgtps slpsptsagp llvpftlnft 

13 021 itnlqyeedm hhpgsrkfnt tervlqgllg pmfkntsvgl lysgcrltll rpekngaatg 
13081 mdaicshrld pkspglnreq lywelsqlth gikelgpytl drnslyvngf thrssvapts 
13141 tpgtstvdlg tsgtpsslps pttavpllvp ftlnftitnl qygedmrhpg srkfntterv 

132 01 lqgllgplfk nssvgplysg crlislrsek dgaatgvdai cthhlnpqsp gldreqlywq 
45 13261 lsqmtngike lgpytldrns lyvngfthrs sglttstpwt stvdlgtsgt pspvpsptta 

13321 gpllvpftln ftitnlqyee dmhrpgsrkf nttervlqgl lspifknssv gplysgcrlt 

133 81 slrpekdgaa tgmdavclyh pnpkrpgldr eqlywelsql thnitelgpy sldrdslyvn 
13441 gfthqnsvpt tstpgtstvy wattgtpssf pghtepgpll ipftfnftit nlhyeenmqh 
13 5 01 pgsrkfntte rvlqgllkpl fkntsvgply sgcrltslrp ekdgaatgmd avclyhpnpk 

50 13 5 61 rpgldreqly welsqlthni telgpysldr dslyvngfth qnsvpttstp gtstvywatt 

13621 gtpssfpght epgpllipft fnftitnlhy eenmqhpgsr kfnttervlq gllkplfknt 
13681 svgplysgcr ltllrpekhe aatgvdtict hrvdpigpgl drerlywels qltnsitelg 
13 741 pytldrdsly vngfnprssv pttstpgtst vhlatsgtps slpghtapvp llipftlnft 
13 8 01 itnlhyeenm qhpgsrkfnt tervlqgllk plfkntsvgp lysgcrltll rpekheaatg 
55 13861 vdticthrvd pigpglxxex lywelsxltx xixelgpytl drxslyvngf thxxsxptts 

13 921 tpgtstvxxg tsgtpssxpx xtsagpllvp ftlnftitnl qyeedmhhpg srkfntterv 
13981 lqgllgpmfk ntsvgllysg crltllrpek ngaatgmdai cshrldpksp gldreqlywe 
14041 lsqlthgike lgpytldrns lyvngfthrs svaptstpgt stvdlgtsgt psslpsptta 
14101 vpllvpf tin ftitnlqyge dmrhpgsrkf nttervlqgl lgplfknssv gplysgcrli 
14161 slrsekdgaa tgvdaicthh lnpqspgldr eqlywqlsqm tngikelgpy tldrnslyvn 
14221 gfthrssglt tstpwtstvd lgtsgtpspv pspttagpll vpftlnftit nlqyeedmhr 
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142 81 pgsrkfiiate rvlqgllspi fknssvgply sgcrltslrp ekdgaatgmd avclyhpnpk 
14341 rpgldreqly welsqlthni telgpysldr dslyvngfth qssmtttrtp dtstmhlats 
144 01 rtpaslsgpt taspllvlft inctitnlqy eedmrrtgsr kf ntmesvlq gllkplfknt 
14461 svgplysgcr ltllrpkkdg aatgvdaict hrldpkspgl nreqlywels kltndieelg 
5 14521 pytldrnsly vngfthqssv sttstpgtst vdlrtsgtps slssptimxx xpllxpf txn 

14581 xtitnlxxxx xmxxpgsrkf nttervlqgl lrplfkntsv sslysgcrlt llrpekdgaa 
14641 trvdaactyr pdpkspgldr eqlywelsql thsitelgpy tldrvslyvn gfnprssvpt 
14701 tstpgtstvh latsgtpssl pghtxxxpll xpftxnxtit nlxxxxxmxx pgsrkfntte 
14761 rvlqgllkpl f rnssleyly sgcrlaslrp ekdssamavd aicthrpdpe dlgldrerly 

10 14821 we 1 snl t rig i qelgpytldr nslyvngfth rssglttstp wtstvdlgts gtpspvpspt 

14881 tagpllvpft Inftitnlqy eedmhrpgsr rfnttervlq glltplfknt svgplysgcr 
14941 ltllrpekqe aatgvdtict hrvdpigpgl drerlywels qltnsitelg pytldrdsly 
15001 vngfnpwssv pttstpgtst vhlatsgtps slpghtapvp llipftlnft itdlhyeenm 
15061 qhpgsrkf nt tervlqgllk plf kstsvgp lysgcrltll rpekhgaatg vdaictlrld 

15 15121 ptgpgldrer lywelsqltn svtelgpytl drdslyvngf thrssvptts ipgtsavhle 

15181 tsgtpaslpg htapgpllvp ftlnftitnl qyeedmrhpg srkfstterv lqgllkplfk 
15241 ntsvsslysg crltllrpek dgaatrvdav cthrpdpksp gldrerlywk lsqlthgite 
153 01 lgpytldrhs lyvngfthqs smtttrtpdt stmhlatsrt paslsgptta spllvlftin 
153 61 ftitnlryee nmhhpgsrkf nttervlqgl Irpvfkntsv gplysgcrlt tlrpkkdgaa 

20 15421 tkvdaictyr pdpkspgldr eqlywelsql thsitelgpy tqdrdslyvn gfthrssvpt 

15481 tsipgtsavh letsgtpasl pghtapgpll vpf tlnftit nlqyeedmrh pgsrkfntte 
15541 rvlqgllkpl fkstsvgply sgcrltllrp ekrgaatgvd ticthrldpl npgldreqly 
15601 welskltrgi ielgpylldr gslyvngfth rtsvpttstp gtstvdlgts gtpfslpspa 
15661 xxxpllxpft xnxtitnlxx xxxmxxpgsr kfnttervlq tllgpmfknt svgllysgcr 

25 15721 ltllrsekdg aatgvdaict hrldpkspgv dreqlywels qltngikelg pytldrnsly 

15781 vngf thwipv ptsstpgtst vdlgsgtpss Ipspttagpl Ivpftlnfti tnlkyeedmh 
15841 cpgsrkfntt ervlqsllgp rafkntsvgpl ysgcrltllr sekdgaatgv daicthrldp 
15901 kspgvdreql ywelsqltng ikelgpytld rnslyvngft hqtsapntst pgtstvdlgt 
15961 sgtpsslpsp txxxpllxpf txnxtitnlx xxxxmxxpgs rkfnttexvl qgllxpxfkn 

30 16021 xsvgxlysgc rltxlrxekx gaatgxdaic xhxxxpkxpg Ixxexlywel sxltxxixel 

16081 gpytldrxsl yvngfthwip vptsstpgts tvdlgsgtps slpspttagp llvpftlnft 
16141 itnlkyeedm hcpgsrkfnt tervlqsllg pmfkntsvgp lysgcrltsl rsekdgaatg 
16201 vdaicthrvd pkspgvdreq lywelsqltn gikelgpytl drnslyvngf thqtsapnts 
16261 tpgtstvxxg tsgtpssxpx xtsagpllvp ftlnftitnl qyeedrahhpg srkfntterv 

35 16321 lqgllgprafk ntsvgllysg crltllrpek ngattgmdai cthrldpksp glxxexlywe 

16381 lsxltxxixe lgpytldrxs lyvngfthxx sxpttstpgt stvxxgtsgt pssxpxxtxx 
16441 xpllxpf txn xtitnlxxxx xmxxpgsrkf nttervlqgl lkplfrnssl eylysgcrla 
16501 slrpekdssa mavdaicthr pdpedlgldr erlywelsnl tngiqelgpy tldrnslyvn 
16561 gfthrssmpt tstpgtstvd vgtsgtpsss pspttagpll ipftlnftit nlqygedmgh 

40 16621 pgsrkfntte rvlqgllgpi fkntsvgply sgcrltslrs ekdgaatgvd aicihhldpk 

16681 spglnrerly welsqltngi kelgpytldr nslyvngfth rtsvpttstp gtstvdlgts 
16741 gtpfslpspa tagpllvlft Inftitnlky eedmhrpgsr kfnttervlq tllgpmfknt 
16801 svgllysgcr ltllrsekdg aatgvdaict hrldpkspgl xxexlywels xltxxixelg 
16861 pytldrxsly vngf thxxsx pttstpgtst vxxgtsgtps sxpxxtxxxp llxpftxnxt 

45 16921 itnlxxxxxm xxpgsrkfnt tervlqgllr pvfkntsvgp lysgcrltll rpkkdgaatk 

16981 vdaictyrpd pkspgldreq lywelsqltn sitelgpytq drdslyvngf thrssvptts 
17 041 ipgtsavhle ttgtpssfpg htepgpllip ftfnftitnl ryeenmqhpg srkfntterv 
17101 Iqglltplfk ntsvgplysg crltllrpek qeaatgvdti cthrvdpigp gldrerlywe 
17161 lsqltnsite lgpytldrds lyvdgfnpws svpttstpgt stvhlatsgt psplpghtap 

50 17221 vpllipftln ftitdlhyee nmqhpgsrkf nttervlqgl lkplfkstsv gplysgcrlt 

172 81 llrpekhgaa tgvdaictlr Idptgpgldr erlywelsql tnsitelgpy tldrdslyvn 
17341 gfnpwssvpt tstpgtstvh latsgtpssl pghttagpll vpftlnftit nlkyeedmhc 
17401 pgsrkfntte rvlqslhgpm fkntsvgply sgcrltllrs ekdgaatgvd aicthrldpk 
17461 spglxxexly welsxltxxi xelgpytldr xslyvngfth xxsxpttstp gtstvxxgts 

55 17521 gtpssxpxxt xxxpllxpft xnxtitnlxx xxxmxxpgsr kfnttexvlq gllxpxf knx 

17581 svgxlysgcr Itxlrxekxg aatgxdaicx hxxxpkxpgl xxexlywels xltnsitelg 
17641 pytldrdsly vngfthrssm pttsipgtsa vhletsgtpa slpghtapgp llvpftlnft 
177 01 itnlqyeedm rhpgsrkfnt tervlqgllk plf kstsvgp lysgcrltll rpekrgaatg 

177 61 vdticthrld plnpglxxex lywelsxl tx xixelgpytl drxslyvngf thxxsxptts 
60 17821 tpgtstvxxg tsgtpssxpx xtxxxpllxp ftxnxtitnl xxxxxmxxpg srkfnttexv 

178 81 lqgllxpxfk nxsvgxlysg crltxlrxek xgaatgxdai cxhxxxpkxp glxxexlywe 
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17941 lsxltxxixe lgpytldrxs lyvngfhprs svpttstpgt stvhlatsgt psslpghtap 



18001 



vpllipftln ftitnlhyee nmqhpgsrkf nttervlqgl lgpmfkntsv gllysgcrlt 
18061 llrpekngaa tgmdaicshr ldpkspglxx exlywelsxl txxixelgpy tldrxslyvn 
18121 gfthxxsxpt tstpgtstvx xgtsgtpssx pxxtxxxpll xpftxnxtit nlxxxxxmxx 
5 18181 pgsrkfntte xvlqgllxpx fknxsvgxly sgcrltxlrx ekxgaatgxd aicxhxxxpk 

18241 xpglxxexly welsxltxxi xelgpytldr xslyvngfth qnsvpttstp gtstvywatt 
18301 gtpssfpght epgpllipft fnftitnlhy eenmqhpgsr kfnttervlq glltplfknt 
18361 svgplysgcr Itllrpekqe aatgvdtict hrvdpigpgl xxexlywels xltxxixelg 
18421 pytldrxsly vngfthxxsx pttstpgtst vxxgtsgtps sxpxxtxxxp llxpftxnxt 
10 18481 itnlxxxxxm xxpgsrkfnt texvlqgllx pxfknxsvgx lysgcrltxl rxekxgaatg 

18541 xdaicxhxxx pkxpglxxex lywelsxltx xixelgpytl drxslyvngf thrssvptts 
18601 spgtstvhla tsgtpsslpg htapvpllip ftlnftitnl hyeenmqhpg srkfntterv 
18 661 Iqgllkplfk stsvgplysg crltllrpek hgaatgvdai ctlrldptgp glxxexlywe 

18 721 lsxltxxixe lgpytldrxs lyvngfthxx sxpttstpgt stvxxgtsgt pssxpxxtxx 
15 18781 xpllxpftxn xtitnlxxxx xmxxpgsrkf nttexvlqgl lxpxfknxsv gxlysgcrlt 

18841 xlrxekxgaa tgxdaicxhx xxpkxpglxx exlywelsxl txxixelgpy tldrxslyvn 
18901 gfthrtsvpt tstpgtstvh latsgtpssl pghtapvpll ipftlnftit nlqyeedmhr 
18961 pgsrkfntte rvlqgllspi fknssvgply sgcrltslrp ekdgaatgmd avclyhpnpk 

19 021 rpgldreqly celsqlthni telgpysldr dslyvngfth qnsvpttstp gtstvywatt 
20 19081 gtpssfpght xxxpllxpft xnxtitnlxx xxxmxxpgsr kfnttexvlq gllxpxfknx 

19141 svgxlysgcr ltxlrxekxg aatgxdaicx hxxxpkxpgl xxexlywels xltxxixelg 
19201 pytldrxsly vngfthwssg lttstpwtst vdlgtsgtps pvpspttagp llvpftlnft 
192 61 itnlqyeedm hrpgsrkfna tervlqglls pifkntsvgp lysgcrltll rpekqeaatg 
19321 vdticthrvd pigpglxxex lywelsxltx xixelgpytl drxslyvngf thxxsxptts 

25 19381 tpgtstvxxg tsgtpssxpx xtxxxpllxp ftxnxtitnl xxxxxraxxpg srkfnttexv 

19441 lqgllxpxfk nxsvgxlysg crltxlrxek xgaatgxdai cxhxxxpkxp glxxexlywe 
19501 lsxltxxixe lgpytldrxs lyvngfthrs fglttstpwt stvdlgtsgt pspvpsptta 
19561 gpllvpftln ftitnlqyee dmhrpgsrkf nttervlqgl Itplfrntsv sslysgcrlt 
19621 llrpekdgaa trvdavcthr pdpkspglxx exlywelsxl txxixelgpy tldrxslyvn 

30 19681 gfthxxsxpt tstpgtstvx xgtsgtpssx pxxtxxxpll xpftxnxtit nlxxxxxmxx 

19741 pgsrkfntte xvlqgllxpx fknxsvgxly sgcrltxlrx ekxgaatgxd aicxhxxxpk 
198 01 xpglxxexly welsxltxxi xelgpytldr xslyvngfth wipvptsstp gtstvdlgsg 
198 61 tpsslpsptt agpllvpftl nftitnlqyg edmghpgsrk fnttervlqg llgpifknts 
19921 vgplysgcrl tslrsekdga atgvdaicih hldpkspglx xexlywelsx ltxxixelgp 

35 19981 ytldrxslyv ngfthxxsxp ttstpgtstv xxgtsgtpss xpxxtxxxpl Ixpftxnxti 

20041 tnlxxxxxrax xpgsrkfntt exvlqgllxp xfknxsvgxl ysgcrltxlr xekxgaatgx 
2 0101 daicxhxxxp kxpglxxexl ywelsxltxx ixelgpytld rxslyvngft hqtfapntst 
2 0161 pgtstvdlgt sgtpsslpsp tsagpllvpf tlnftitnlq yeedmhhpgs rkfnttervl 
20221 qgllgpmfkn tsvgllysgc rltllrpekn gaatrvdavc thrpdpkspg Ixxexlywel 

40 2 02 81 sxltxxixel gpytldrxsl yvngfthxxs xpttstpgts tvxxgtsgtp ssxpxxtapv 

2 0341 pllipftlnf titnlhyeen mqhpgsrkfn ttervlqgll kplfkstsvg plysgcrltl 
20401 lrpekhgaat gvdaictlrl dptgpgldre rlywelsqlt nsvtelgpyt ldrdslyvng 
20461 ftqrssvptt sipgtsavhl etsgtpaslp ghtapgpllv pftlnftitn lqyevdmrhp 
2 0521 gsrkfntter vlqgllkplf kstsvgplys gcrltllrpe krgaatgvdt icthrldpln 

45 20581 pgldreqlyw elskltrgii elgpylldrg slyvngfthr nfvpitstpg tstvhlgtse 

20641 tpsslprpiv pgpllvpftl nftitnlqye eamrhpgsrk fnttervlqg llrplfknts 
2 0701 igplysscrl tllrpekdka atrvdaicth hpdpqspgln reqlywelsq lthgitelgp 
2 0761 ytldrdslyv dgfthwspip ttstpgtsiv nlgtsgipps lpettxxxpl Ixpftxnxti 
20821 tnlxxxxxrax xpgsrkfntt ervlqgllkp lfkstsvgpl ysgcrltllr pekdgvatrv 

50 20881 daicthrpdp kipgldrqql ywelsqlths itelgpytld rdslyvngft qrssvpttst 

2 0941 pgtftvqpet setpsslpgp tatgpvllpf tlnftitnlq yeedrahrpgs rkfnttervl 
21001 qgllmplfkn tsvsslysgc rltllrpekd gaatrvdavc thrpdpkspg ldrerlywkl 
21061 sqlthgitel gpytldrhsl yvngfthqss mtttrtpdts tmhlatsrtp aslsgpttas 
21121 pllvlftinf titnlryeen mhhpgsrkfn ttervlqgll rpvfkntsvg plysgcrltl 

55 21181 lrpkkdgaat kvdaictyrp dpkspgldre qlywelsqlt hsitelgpyt ldrdslyvng 

21241 ftqrssvptt sipgtptvdl gtsgtpvskp gpsaaspllv Iftlnftitn lryeenmqhp 
21301 gsrkfntter vlqgllrslf kstsvgplys gcrltllrpe kdgtatgvda icthhpdpks 
213 61 prldreqlyw elsqlthnit elghyaldnd slfvngfthr ssvsttstpg tptvylgask 
21421 tpasifgpsa ashllilftl nftitnlrye enmwpgsrkf nttervlqgl Irplfkntsv 

60 21481 gplysgsrlt llrpekdgea tgvdaicthr pdptgpgldr eqlylelsql thsitelgpy 

21541 tldrdslyvn gfthrssvpt tstgvvseep ftlnftinnl rymadmgqpg slkfnitdnv 
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21601 mkhllsplfq rsslgarytg crvialrsvk ngaetrvdll ctylqplsgp glpikqvfhe 

21661 lsqqthgitr lgpysldkds lylngynepg Ideppttpkp attflpplse attamgyhlk 

21721 tltlnftisn lqyspdmgkg satfnstegv lqhllrplfq kssmgpfylg cqlislrpek 

21781 dgaatgvdtt ctyhpdpvgp gldiqqlywe lsqlthgvtq lgfyvldrds lfingyapqn 

5 21841 lsirgeyqin fhivnwnlsn pdptsseyit llrdiqdkvt tlykgsqlhd tfrfclvtnl 

21901 tmdsvlvtvk alfssnldps lveqvf Idkt lnasf hwlgs tyqlvdihvt ernes svyqpt 

21961 sssstqhfyl nftitnlpys qdkaqpgttn yqrnkrnied alnqlf rnss iksyf sdcqv 

22021 stf rsvpnrh htgvdslcnf splarrvdrv aiyeef lrmt rngtqlqnf t ldrssvlvdg 

22 081 yspnrneplt gnsdlpfwav iliglagllg litclicgvl vttrrrkkeg eynvqqqcpg 
10 22141 yyqshldled lq 
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CA125 nucleic acid Genbank No. AF414442 



CDS 205. . 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 



66663 
aagcgttgca 
cccaggtcaa 

gggagctcag 

ccagaggcaa 
cccacccgct 
ggactgacag 
catactctgc 
ggaagaacca 
ggaatgacac 
ccctctagga 
accagttcca 
accacaagtg 
ggtgacagca 
atgaaaacat 
gctgagacca 
ctttattctt 
agcctggaac 
gcaggacaca 
aataaaatat 
ggggtgcccg 
ctaagcaatg 
ccatcaatat 
accatggata 
ccaggtaccc 
ttagtctcag 
ttgaatacat 
gccaccttgg 
caggtctctt 
cagagttcca 
acctcaaaag 
cacacacagt 
agtgtggcag 
aagaccagct 
gaatccacag 
ggaggcagca 
tcatctgaga 
ccagcagtga 
acaatggttt 
ggaaccagcc 
gactctgcag 
cttgaggata 
accacaggga 
atgaccctaa 



caattccccc 
atgcggggac 
aagctctgca 
atgttgacct 
ccttgatgac 
gagccacctt 
cctttacttc 
cccagtcttt 
actccgagca 
actaccctgc 
cagaaggaaa 
gcccagtcac 
cagagacccc 
ttgcagattc 
cagttactga 
ccttccttga 
tgattctatc 
gcagaataag 
cagagaccag 
aggccagagc 
cagaaacaag 
ccacaaagca 
tacccagcac 
ttggtggcac 
aggagaccaa 
ctatgactcc 
tccccactct 
catcccaccc 
gcacagctgc 
catcatcatg 
gggtggagac 
cccctatcac 
ccacaaaagg 
ctggcccaac 
gcaccagggg 
catccgcaga 
gcaagacggc 
cttctgtcat 
tgggactgac 
gacacaccaa 
aagtgtcagc 
ctcctgaaat 
gcaatgcagc 



aacctccata 
cccagccata 
gaggccaccc 
gtccatgctg 
agggagcagg 
gtcacctaag 
cccagataag 

gggggtgatg 

aagaaccagc 
tacaagcatg 
ttttaccaaa 
tgagaagtac 
ctgggacaca 
aactgcatcc 
ctcacatact 
cctatcacct 
aaccactgga 
taccagtgcg 
catattctca 
cagcacaatg 
tgccgaaagg 
gacagcagag 
ccacatagcc 
cagcacttca 
cacccatcac 
acttgagacc 
aggttttaca 
aacaagagag 
ccacgggagc 
gaccagtgaa 
aagtcctagc 
cacttctgtt 
gatttggctt 
cacccatcag 
aagccagggc 
tttgactctg 
tgctggctca 
ccctgagaca 
tccattaaac 
gataagcacc 
gaccagcaca 
ctcaacaaag 
aacaagtcct 



catacggcag 
tctcccaccc 
tctctgaggg 
aaaccctcag 
agcactaaag 
acatctacag 
accttggcca 
tcctctgctc 
ccatcgctga 
gtttcaggat 
gaagcatcta 
acagtcccca 
agatatattc 
aaggaaaatg 
gg aggaagga 
aaagggaccc 
tatcccttct 
cctttgtcat 
ggccagagtc 
cccaactcag 
gtcagaagca 
actatcctta 
aagactttgg 
gcgctgacaa 
tccacgagtg 
tctgctcctg 
actcttgaca 
ctcagaacca 
tctgacatcc 
agcacagctc 
atgaaaacag 
ccctcagtgg 
gaagaaacat 
tttgctgttc 
acaacccacG 
gccacgaacg 
agtcctccag 
tcatctctac 
actagacatc 
agcatfccctc 
ttctcacacc 
acaaagccca 
gaaagagtca 



ctcttctaga 
tgagaaattt 
gattcttctt 
gccttcctgg 
ccacaccaga 
gtgcaatcgt 
gtcctacatc 
tccctgagtc 
gtccccaggt 
tgagttcccc 
catacacact 
ctgagacctc 
ctgtaaaaat 
ccccagtgtc 
caaacccatc 
caaattccag 
cctctcctga 
catctgcttc 
tcacctcccc 
ctatcccttt 
caatttcctc 
ccttccatgc 
cttcagaatg 
ccacatctcc 
gaaaggaaac 
gagaagagtc 
gcaagatcag 
caggcagcac 

tgagggcaac 

agcaatttag 
agagaccccc 
tctctggctt 
ctgcagacac 
ccactgggat 
tactcaccag 
gtgtcccagt 
gagggacaaa 
agtcctcagc 
ccttctcttc 
tgttgtcatc 
acaaagccac 
gctcagccgt 
gaaatgcaac 



cacaggtttt 
tggagtttca 
agacctccat 
gtcatcttct 
aatggattca 
ggtgacagaa 
ttcggttgtg 
aacctctaga 
caatggaact 
aaggaccagg 
cactgtagag 
aacaactgaa 
cacatctcca 
tatgactcca 
atttgggaca 
aggtgaaaca 
acctggctct 
agttctcgat 
tctgtctcct 
ttccatgaca 
tctggggact 
cttcgctgag 
gttgggaagt 
atctaccact 
agaaggaact 
cgaaatgact 
aagtccatct 
ctctgggagg 
cacttccagc 
tgaaccccag 
agcatcaacc 
caccaccctg 
actcatcgga 
ttcaatgaca 
agccacagca 
ctccgtgtct 
gccatcatat 
tttcagggaa 
ccctgaacca 
tgcttcagtt 
ctcatctatt 
tctttcctcc 
ttcccctctg 
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25 81 actcatccat ctccatcagg ggaagagaca gcagggagtg tcctcactct cagcacctct 
2641 gctgagacta cagactcacc taacatccac ccaactggga cactgacttc agaatcgtca 
2701 gagagtccta gcactctcag cctcccaagt gtctctggag tcaaaaccac attttcttca 
2 761 tctactcctt ccactcatct atttactagt ggagaagaaa cagaggaaac ttcgaatcca 
5 2 821 tctgtgtctc aacctgagac ttctgtttcc agagtaagga ccaccttggc cagcacctct 

2881 gtccctaccc cagtattccc caccatggac acctggccta cacgttcagc tcagttctct 

2 941 tcatcccacc tagtgagtga gctcagagct acgagcagta cctcagttac aaactcaact 

3 001 ggttcagctc ttcctaaaat atctcacctc actgggacgg caacaatgtc acagaccaat 
3 061 agagacacgt ttaatgactc tgctgcaccc caaagcacaa cttggccaga gactagtccc 

10 3121 agattcaaga cagggttacc ttcagcaaca accactgttt caacctctgc cacttctctc 

3181 tctgctactg taatggtctc taaattcact tctccagcaa ctagttccat ggaagcaact 
3241 tctatcaggg aaccatcaac aaccatcctc acaacagaga ccacgaatgg cccaggctct 
33 01 atggctgtgg cttctaccaa catcccaatt ggaaagggct acattactga aggaagattg 
33 61 gacacaagcc atctgcccat tggaaccaca gcttcctctg agacatctat ggattttacc 

15 3421 atggccaaag aaagtgtctc aatgtcagta tctccatctc agtccatgga tgctgctggc 

3481 tcaagcactc caggaaggac aagccaattc gttgacacat tttctgatga tgtctatcat 
3 541 ttaacatcca gagaaattac aatacctaga gatggaacaa gctcagctct gactccacaa 
3 601 atgactgcaa ctcaccctcc atctcctgat cctggctctg ctagaagcac ctggcttggc 
3661 atcttgtcct catctccttc ttctcctact cccaaagtca caatgagctc cacattttca 

20 3721 actcagagag tcaccacaag catgataatg gacacagttg aaactagtcg gtggaacatg 

3 781 cccaacttac cttccacgac ttccctgaca ccaagtaata ttccaacaag tggtgccata 
3 841 ggaaaaagca ccctggttcc cttggacact ccatctccag ccacatcatt ggaggcatca 
3901 gaagggggac ttccaaccct cagcacctac cctgaatcaa caaacacacc cagcatccac 

3 961 ctcggagcac acgctagttc agaaagtcca agcaccatca aacttaccat ggcttcagta 
25 4021 gtaaaacctg gctcttacac acctctcacc ttcccctcaa tagagaccca cattcatgta 

4 081 tcaacagcca gaatggctta ctcttctggg tcttcacctg agatgacagc tcctggagag 
4141 actaacactg gtagtacctg ggaccccacc acctacatca ccactacgga tcctaaggat 

42 01 acaagttcag ctcaggtctc tacaccccac tcagtgagga cactcagaac cacagaaaac 
4261 catccaaaga cagagtccgc caccccagct gcttactctg gaagtcctaa aatctcaagt 

30 4321 tcacccaatc tcaccagtcc ggccacaaaa gcatggacca tcacagacac aactgaacac 

43 81 tccactcaat tacattacac aaaattggca gaaaaatcat ctggatttga gacacagtca 
4441 gctccaggac ctgtctctgt agtaatccct acctccccta ccattggaag cagcacattg 
4501 gaactaactt ctgatgtccc aggggaaccc ctggtccttg ctcccagtga gcagaccaca 
45 61 atcactctcc ccatggcaac atggctgagt accagtttga cagaggaaat ggcttcaaca 

35 4621 gaccttgata tttcaagtcc aagttcaccc atgagtacat ttgctatttt tccacctatg 

4681 tccacacctt ctcatgaact ttcaaagtca gaggcagata ccagtgccat tagaaataca 
4 741 gattcaacaa cgttggatca gcacctagga atcaggagtt tgggcagaac tggggactta 
4 8 01 acaactgttc ctatcacccc actgacaacc acgtggacca gtgtgattga acactcaaca 
4861 caagcacagg acaccctttc tgcaacgatg agtcctactc acgtgacaca gtcactcaaa 

40 4 921 gatcaaacat ctataccagc ctcagcatcc ccttcccatc ttactgaagt ctaccctgag 

4 981 ctcgggacac aagggagaag ctcctctgag gcaaccactt tttggaaacc atctacagac 

5 041 acactgtcca gagagattga gactggccca acaaacattc aatccactcc acccatggac 
5101 aacacaacaa cagggagcag tagtagtgga gtcaccctgg gcatagccca ccttcccata 
5161 ggaacatcct ccccagctga gacatccaca aacatggcac tggaaagaag aagttctaca 

45 5221 gccactgtct ctatggctgg gacaatggga ctccttgtta ctagtgctcc aggaagaagc 

5281 atcagccagt cattaggaag agtttcctct gtcctttctg agtcaactac tgaaggagtc 
5341 acagattcta gtaagggaag cagcccaagg ctgaacacac agggaaatac agctctctcc 
54 01 tcctctcttg aacccagcta tgctgaagga agccagatga gcacaagcat ccctctaacc 
5461 tcatctccta caactcctga tgtggaattc atagggggca gcacattttg gaccaaggag 

50 5 521 gtcaccacag ttatgacctc agacatctcc aagtcttcag caaggacaga gtccagctca 

5581 gctaccctta tgtccacagc tttgggaagc actgaaaata caggaaaaga aaaactcaga 
5641 actgcctcta tggatcttcc atctccaact ccatcaatgg aggtgacacc atggatttct 

57 01 ctcactctca gtaatgcccc caataccaca gattcacttg acctcagcca tggggtgcac 
5 7 61 accagctctg cagggacttt ggccactgac aggtcattga atactggtgt cactagagcc 

55 5821 tccagattgg aaaacggctc tgatacctct tctaagtccc tgtctatggg aaacagcact 

58 81 cacacttcca tgactgacac agagaagagt gaagtgtctt cttcaatcca tccccgacct 
5941 gagacctcag ctcctggagc agagaccact ttgacttcca ctcctggaaa cagggccata 
6001 agcttaacat tgcctttttc atccattcca gtggaagaag tcatttctac aggcataacc 
6061 tcaggaccag acatcaactc agcacccatg acacattctc ccatcacccc accaacaatt 

60 6121 gtatggacca gtacaggcac aattgaacag tccactcaac cactacatgc agtttcttca 

6181 gaaaaagttt ctgtgcagac acagtcaact ccatatgtca actctgtggc agtgtctgct 
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6241 

6301 

6361 

6421 

6481 

6541 

6601 

6661 

6721 

6781 

6841 

6901 

6961 

7021 

7081 

7141 

7201 

7261 

7321 

7381 

7441 

7501 

7561 

7621 

7681 

7741 

7801 

7861 

7921 

7981 

8041 

8101 

8161 

8221 

8281 

8341 

8401 

8461 

8521 

8581 

8641 

8701 

8761 

8821 

8881 

8941 

9001 

9061 

9121 

9181 

9241 

9301 

9361 

9421 

9481 

9541 

9601 

9661 

9721 

9781 

9841 



tcccctaccc 
gcctcacttg 

tgggacctca 

ctgtcctcag 
ttttcagctg 
ccccagaata 
gagactcctg 
agatcagatg 
tcttcagcag 
tcctttagaa 
actaatacag 
gcatcagatg 
tccccctccc 
tcatttagaa 

ggggattgga 
ctgggagaga 

gctagtttca 
gatatccttg 
ctgatcaaca 
cccactagca 
ccccaggttc 
aacacctatt 
tttacaatca 
agaactttca 
aattctgtgg 
gacttacctg 
gcatcaacta 
ggatccagtg 
tcttcaccac 
cctgagagcc 
gtcacagata 
gggactctgt 
actgccactg 
atagcttcgg 
aatggtgatg 
actatccaag 
gttgctccac 
gcgggaacta 
ttggtagact 
cagggtatgg 
acattttctt 
accacgaaca 
accagtgctg 
tttgcctcac 
tttgagtttt 
cagtccctga 
tctccagaaa 
tcatttcaat 
gggcatgaac 
atgtcttatc 
aaactcacaa 
ataagcctat 
aagacttcac 
cagacttcag 
cctgagttaa 
acgttaacct 
gaacccacag 
gccaagacct 
agccaggcag 
ccagcaggca 
aaggaaccca 



atgagaattc 

aatccttgga 

ctacatctct 

gccattctgg 

catccacatc 

cagccgcgag 

tgggggcaag 

tttctggcct 

ggaccaaatt 

tgaacaagga 

aaacaagcat 

taattgacac 

ctgatactga 

ccatttcatc 

tgagttcagc 

gaaggacaat 

cagagaccag 

acgcacggaa 

cctccatagc 

tctcaggaat 

ccacatctac 

ctcttgatgg 

cccaccctgt 

gcaccatggt 

tgacttctgt 

ccatgggctt 

ttgaaccagc 

ctacatcaga 

agaccccaac 

ttttggtagt 

ccatcttgtt 

ctggagcttc 

agccaacaag 

atagtcttgg 

cactggttct 

gagtaacaga 

ggaatacaac 

ctggttccct 

catcagctgg 

ctagctctgg 

ctctccctct 

gactgacagc 

agtcaggcct 

tgactacagc 

ctgaggtccc 

acaccattcc 

aaaacccaag 

caacaggttt 

ccagagtccc 

cagacccaag 

ctctcttcag 

ccactgagaa 

tatttcttgg 

ctctgacact 

cctcaagcca 

tcacaccatc 

ccagaagaaa 

ccttggttga 

cacaaggaaa 

catccccagg 

gcatcagccc 



agtctcttct 

ttccacaatc 

ccccactaca 

ggtttcaaac 

tgctgctaag 

tactttgaac 

tatcagctct 

tacatctgag 

aactaggaca 

tccatggaca 

cccagtaaac 

accttcagat 

agtgacaact 

tctcactcat 

tatgtctaca 

cacctctgct 

cacagtttca 

aacaaatgag 

ttcttcaact 

gacagcaagt 

aacagagaca 

gggctccaat 

cgagacaagc 

cagcactgac 

tccagcacca 

tctcaagaca 

cactgccttc 

agccagtctt 

tacacccacc 

cactgagact 

ttcaactgtg 

cttccctact 

ttcattagct 

cacagtccca 

taagacagta 

aagtccactc 

ctatgaaggt 

tgtattcagt 

gcttgagagg 

aggaatcaga 

gaccatgaac 

tactcaatca 

cctaacacct 

tcccccatca 

aagtttggat 

agactcagat 

ggcaaggatg 

tactgaaacc 

cacttcagga 

caaggcatca 

cacaggtcaa 

agaaacaagc 

gccttccatg 

ttctccaaca 

gaccattgca 

tgagacccca 

gagttctcca 

aacaactgat 

ttccacgtgg 

aagcccagaa 

agagatcagg 



ggaagcagca 
agtaggagga 
acttggccaa 
ccaagttcaa 
caaagaaatc 
actgatgcat 
gaagtccctc 
agtactgcta 
atatccctgc 
gtgtcaatcc 
agcgcaggtc 

ggggctgaga 

atctcacatt 
gagttgactt 
aagcccacag 
gctccaacca 
ctggataatg 
ctcccctcag 
atggatgtca 
tcctccccat 
aatacagcca 
gtgggtggca 
tcggccctat 
actgcctccg 
ggtacatggg 
agtcctgcag 
actccccatc 
ctcactacga 
tctggagcaa 
tcagacacaa 
tccacgccac 
ttactcccgg 
acatcctttg 
gagactaccc 
agtaacccag 
catccttctt 
tcgatcacag 
cagagttctg 
gcatctgtga 

a g fc gggtcca 

ccaggtgagg 
acagcaccca 
gtctctgcct 
acttggggga 
actaagtccg 
gcaagcacag 
atgacttcca 
cctgagggat 
acaggggacc 
tcagctatga 
gcagcaaggt 
ttcctttccc 
gcaaggcagc 
tccactctaa 
gaagaagagg 
acatccttgt 
gaaacatggg 
ggaacgctag 
cctgccccag 
gtgtctacca 
tccactgtgc 



catcctctcc 

atgcaatcac 

gtactagttt 

ctacgactga 

cagaaacaga 

cctcggtcac 

ttccaatggc 

acccgagttt 

ccacttcaga 

ctttggggtc 

cacctggctt 

gtattcccac 

tcccagaaaa 

caagagtgac 

gagccagtcc 

cttcccccat 

aaactacagt 

atagcagttc 

ctaaaacagc 

ctctcttctc 

cctctccatc 

ctccatccac 

tagcctggtc 

gagaaaatcc 

ccagtgtagg 

gagaggcaca 

tctcagcagc 

gtgaaagcaa 

actgggaaac 

cacttacctc 

cttctaaatt 

acactccagc 

attccacccc 

tgaccatgtc 

ataggagcat 

ccacttcccc 

tggcactttc 

aaaactcaga 

tgccactaac 

ctcactcaac 

ttacagccat 

aagggatacc 

cctcaagccc 

tcccacagtc 

cttctttacc 

catcttcctc 

caaaggccat 

ctgcctcccc 

ctagatatgc 

catcgacctc 

ctggttctag 

ccactgcatc 

ccaacatatt 

atatgtccca 

gaacaacagc 

tacctgtctc 

caagctctat 

tgaccaccat 

cagaggagac 

ctctcaaaat 

gaaattctcc 



atattcctca 

ttcctggcta 

atctgaggca 

atttccactc 

gacccatggt 

aggtctttct 

cataacttct 

aggcacagcc 

gtctttggtt 

ccatccaact 

gtccacagta 

tgtctccttt 

gacaactcat 

acctattcct 

ctccattaca 

agttctcact 

aaaaacctca 

ttcttctgat 

ctccatcagt 

ttcagataga 

tgtttccagt 

tttaccaccc 

tagaccagta 

tacctctagc 

cagtactact 

ctcacttcta 

agtggtcact 

agccattcat 

ttcagctact 

aaagattttg 

tccaagtacg 

catccctctc 

actggtgact 

agagacctca 

ccctggaatc 

ctctaagatt 

tactttgcct 

gacaacggct 

cacaggaagc 

tggaaccaaa 

gtctgaaatc 

tgtgaagccc 

atcaaaggcc 

taccttgaca 

aactcctgga 

actgtccaag 

aagtgcaagc 

ttctatggca 

ctcagagagc 

tcttgcatca 

ttcctctccc 

cacctccaga 

ggtgcatctt 

ggaggagcct 

tgaaacacag 

ttctcccaca 

ttcagttcct 

aaagatgtca 

ggggaccagt 

catgagctcc 

ttggaagact 



WO 2004/075713 



10/47 



PCT/CA2004/000281 



30 



9901 ccagaaacaa ctgttcccat ggagaccaca gtggaaccag tcacccttca gtccacagcc 
9961 ctaggaagtg gcagcaccag catctctcac ctgcccacag gaaccacatc accaaccaag 
10021 tcaccaacag aaaatatgtt ggctacagaa agggtctccc tctccccatc cccacctgag 
10081 gcttggacca acctttattc tggaactcca ggagggacca ggcagtcact ggccacaatg 
5 10141 tcctctgtct ccctagagtc accaactgct agaagcatca cagggactgg tcagcaaagc 

102 01 agtccagaac tggtttcaaa gacaactgga atggaattct ctatgtggca tggctctact 
102 61 ggagggacca caggggacac acatgtctct ctgagcacat cttccaatat ccttgaagac 
10321 cctgtaacca gcccaaactc tgtgagctca ttgacagata aatccaaaca taaaaccgag 
10381 acatgggtaa gcaccacagc cattccctcc actgtcctga ataataagat aatggcagct 
10 10441 gaacaacaga caagtcgatc tgtggatgag gcttattcat caactagttc ttggtcagat 

10501 cagacatctg ggagtgacat cacccttggt gcatctcctg atgtcacaaa cacattatac 
10561 atcacctcca cagcacaaac cacctcacta gtgtctctgc cctctggaga ccaaggcatt 
10621 acaagcctca ccaatccctc aggaggaaaa acaagctctg cgtcatctgt cacatctcct 
10 681 tcaatagggc ttgagactct gagggccaat gtaagtgcag tgaaaagtga cattgcccct 
15 10741 actgctgggc atctatctca gacttcatct cctgcggaag tgagcatcct ggacgtaacc 

10801 acagctccta ctccaggtat ctccaccacc atcaccacca tgggaaccaa ctcaatctca 
10 8 61 actaccacac ccaacccaga agtgggtatg agtaccatgg acagcacccc ggccacagag 
10921 aggcgcacaa cttctacaga acacccttcc acctggtctt ccacagctgc atcagattcc 
10981 tggactgtca cagacatgac ttcaaacttg aaagttgcaa gatctcctgg aacaatttcc 
20 11041 acaatgcata caacttcatt cttagcctca agcactgaat tagactccat gtctactccc 

11101 catggccgta taactgtcat tggaaccagc ctggtcactc catcctctga tgcttcagct 
11161 gtaaagacag agaccagtac aagtgaaaga acattgagtc cttcagacac aactgcatct 
11221 actcccatct caactttttc tcgtgtccag aggatgagca tctcagttcc tgacatttta 
11281 agtacaagtt ggactcccag tagtacagaa gcagaagatg tgcctgtttc aatggtttct 
25 11341 acagatcatg ctagtacaaa gactgaccca aatacgcccc tgtccacttt tctgtttgat 

114 01 tctctgtcca ctcttgactg ggacactggg agatctctgt catcagccac agccactacc 
11461 tcagctcctc agggggccac aactccccag gaactcactt tggaaaccat gatcagccca 
11521 gctacctcac agttgccctt ctctataggg cacattacaa gtgcagtcac accagctgca 
11581 atggcaagga gctctggagt tactttttca agaccagatc ccacaagcaa aaaggcagag 
11641 cagacttcca ctcagcttcc caccaccact tctgcacatc cagggcaggt gcccagatca 
11701 gcagcaacaa ctctggatgt gatcccacac acagcaaaaa ctccagatgc aacttttcag 
11761 agacaagggc agacagctct tacaacagag gcaagagcta catctgactc ctggaatgag 
11821 aaagaaaaat caaccccaag tgcaccttgg atcactgaga tgatgaattc tgtctcagaa 
11881 gataccatca aggaggttac cagctcctcc agtgtattaa aggaccctga atacgctgga 
35 11941 cataaacttg gaatctggga cgacttcatc cccaagtttg gaaaagcagc ccatatgaga 

12001 gagttgcccc ttctgagtcc accacaggac aaagaggcaa ttcacccttc tacaaacaca 

12 061 gtagagacca caggctgggt cacaagttcc gaacatgctt ctcattccac tatcccagcc 
12121 cactcagcgt catccaaact cacatctcca gtggttacaa cctccaccag ggaacaagca 
12181 atagtttcta tgtcaacaac cacatggcca gagtctacaa gggctagaac agagcctaat 

40 12241 tccttcttga ctattgaact gagggacgtc agcccttaca tggacaccag ctcaaccaca 

12301 caaacaagta ttatctcttc cccaggttcc actgcgatca ccaaggggcc tagaacagaa 
123 61 attacctcct ctaagagaat atccagctca ttccttgccc agtctatgag gtcgtcagac 
12421 agcccctcag aagccatcac caggctgtct aactttcctg ccatgacaga atctggagga 
12481 atgatccttg ctatgcaaac aagtccacct ggcgctacat cactaagtgc acctactttg 

45 12541 gatacatcag ccacagcctc ctggacaggg actccactgg ctacgactca gagatttaca 

12601 tactcagaga agaccactct ctttagcaaa ggtcctgagg atacatcaca gccaagccct 
12661 ccctctgtgg aagaaaccag ctcttcctct tccctggtac ctatccatgc tacaacctcg 
12721 ccttccaata ttttgttgac atcacaaggg cacagtccct cctctactcc acctgtgacc 
12781 tcagttttct tgtctgagac ctctggcctg gggaagacca cagacatgtc gaggataagc 

50 12841 ttggaacctg gcacaagttt acctcccaat ttgagcagta cagcaggtga ggcgttatcc 

12901 acttatgaag cctccagaga tacaaaggca attcatcatt ctgcagacac agcagtgacg 
12961 aatatggagg caaccagttc tgaatattct cctatcccag gccatacaaa gccatccaaa 

13 021 gccacatctc cattggttac ctcccacatc atgggggaca tcacttcttc cacatcagta 
13 081 tttggctcct ccgagaccac agagattgag acagtgtcct ctgtgaacca gggacttcag 

55 13141 gagagaagca catcccaggt ggccagctct gctacagaga caagcactgt cattacccat 

13201 gtgtctagtg gtgatgctac tactcatgtc accaagacac aagccacttt ctctagcgga 
13261 acatccatct caagccctca tcagtttata acttctacca acacatttac agatgtgagc 
13321 accaacccct ccacctctct gataatgaca gaatcttcag gagtgaccat caccacccaa 
13 3 81 acaggtccta ctggagctgc aacacagggt ccatatctct tggacacatc aaccatgcct 
13441 tacttgacag agactccatt agctgtgact ccagatttta tgcaatcaga gaagaccact 
13501 ctcataagca aaggtcccaa ggatgtgacc tggacaagcc ctccctctgt ggcagaaacc 
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agctatccct 
ttacaagggc 
gtgaagacca 
ttgaacaatc 
attcatcctt 
ctgcattcca 
gcctccagca 
attgagggag 
gagatgaact 
actgaagcca 
cagtcaacaa 
cccatgacaa 
aagattccac 
gtgactgagg 
gtgtcacaga 
gtccttgtca 
tctcctgttt 
gatacaagct 
atatcggtca 
gtagttacca 
tacccagagc 
acaatttccc 
tcctccctga 
gagacaagca 
gatgtcactt 
gacatctcca 
gaaataacca 
atggacccat 
tcacaattgg 
cctccctctg 
acaccttccc 
cttctcacct 
accagctcac 
agtaaagaca 
aacaactctg 
acaactcaaa 
gtgcatggtt 
agactgagag 
ctttccaaag 
agcaaaattt 
gagatcccca 
acccaagcaa 
acactttccc 
accactctca 
gaaaccagct 
tcctccacat 
tctgttctgg 
tcacctccaa 
cacacagaag 
gggtctggac 
cctctgatga 
atctctcaga 
gagagcagca 
gtgcccacag 
atctcagacc 
ctcttcacct 
actcctgggg 
ggagggactc 
agcagaaccc 
gggttttccc 
gagagcatcc 



cttccctgac 
aacatacatc 
cagatatgtt 
catcaaatga 
ccataaacaa 
ctctcccagt 
tgggggacgc 

agccaacatc 
caactacaga 
caaaaatgga 
agttcccaga 
tatctaccca 
ttgccttaga 
ggtttgccca 
caaaccctcc 
caaccttacc 
ctatgtcctc 
tagaaacagt 
cttcagcagc 
atgtggggac 
catctaaagt 
gatcaatacc 
ctcctaaact 
ctgttcctta 
cctctagcag 
cagaaaccaa 
tcaccaccca 
caaatacaac 
atgtgaccac 
tggataaaac 
tgatttcttc 
ctggcctagt 
ttccaaattt 
caaaggaaat 
gacatgaatc 
tggttatcac 
cctctgagac 
agaccagtac 
tccccactgg 
ccaccccaga 
gggtcttcac 
gtcctcctga 
agggagggac 
tgggcatggg 
ctgtgtcttc 
caccacagag 
tgacaaccac 
atttgagcag 
ccgccatgca 
ataaatcaca 
gtaccacctc 
ctaaccaaat 
cgtctgagaa 
gtgctattac 
ttgatcggcc 
cccccatcat 
ctacatcaca 
attcaaccgt 
ctggagatgt 
tgatgtcacc 
cctcctctcc 



acctttcttg 
ctctcctgtt 
gaacacaagc 
gatactggcc 
agcagtgacc 
cagctcagaa 
tcttgcttct 
ctccctgact 
gtcaaacatc 
agtcccctct 
tattttctca 
catgaccacc 
cacatcaacc 
ctcaaaaata 
ctttcaggat 
ttcttctgtt 
agttcttact 
gaccagttca 
caccacagat 
caccggttca 
cacatctcca 
taaatcctct 
gagggagacc 
caaagagctc 
tacatccttc 
caccaggctg 
aacaggtcct 
cccccaggca 
tcttatgagc 
cagctccccc 
taccttacca 
gaagattaca 
cagcagcacc 
ttttccttct 
ccattcccct 
caccactgtg 
tacaaacatt 
ctctcaggag 
tactattact 
ccatgataag 
ctcctctatt 
gtctgcatcg 
tcattcaact 
tcctgggaat 
cctgatgtct 
catcccctcc 
agatgtgttg 
catcactcat 
tcattccaca 
atcctctgtc 
caccctgggg 
tcaaacagag 
gaccagctca 
tcaggcctcc 
cacaatagca 
gacaaaatct 
gggtatcctt 
gtctcaggga 
gtcatggatg 
ttccatgaca 
tctccctgtg 



gtcacaacca 
tctgcgactt 
atggaacctg 
actttggcag 
aatatgggga 
ccatctacag 
atatcaatac 
gctggacgaa 
atcctctcca 
tttgatgcaa 
gtagccagca 
acccagacag 
ttggaaacct 
accactgcaa 
gaagccagct 
gctttcacac 
tcttcactgg 
cctcaaagta 
atagagacaa 
gcatttgaat 
aatgttacca 
aagactacaa 
agcatctccc 
actggtgcca 
cctggccctg 
tctacctccc 
catggggcta 
gggatccact 
agaattccac 
tcttcctttc 
gaggataagc 
gacatattac 
tcagataaga 
ataaacacag 
gcactggctg 
ggagatccag 
aagagagagc 
tccagctttc 
gaggtctcca 
tccacagtgc 
aagacaaaat 
cacagtaccc 
gtgactcagg 
gtgtcatgga 
tcacctgcca 
tctcctcttc 
ggcacaacaa 
gagagaccgg 
aacaccgcag 
ctagctgact 
gacacaagtg 
ccaacagcat 
acaacagaga 
agaacagaaa 
cccgacatct 
gcagaaatga 
ccttgggaca 
ttcccacact 
acaactcccc 
tccccttctc 
actgcacttc 



tacctcctgc 
cagttcttac 
tgaccaattc 
ccaccacaga 
ctgccagttc 
ccacatctcc 
ctggttctga 
aagagaacag 
atgtgtctgt 
cattcatacc 
gtagactttc 
ggtcttctgg 
cagcagggac 
tgaacaatga 
ctccctcttc 
cgcaatggca 
taaagaccgc 
tgagcaacac 
cgcatccttc 
cacattctac 
cctccaccat 
gaactgagac 
aggagatcac 
ctaccgaggt 
atcagtccac 
caataatgac 
catcacagga 
cagctatgac 
aggatgtatc 
tgtcctcacc 
tctcctctcc 
gtacacgctt 
tactggccac 
aagagaccaa 
actcagagac 
ctccttccac 
caacatattt 
ccacggacac 
gtacaggggt 
cacctgacac 
ctgcagaaat 
ttcccttgga 
gattcccata 
tgacaactcc 
tgacatcccc 
ctgtgactgc 
gcccagagtc 
ccacttacaa 
tgaccaatgt 
cagagacatc 
tttccacatc 
ccctgagccc 
caaatactgc 
tctcctctag 
ccacaggaat 
ccgtcaccac 
catcaaccac 
cagagataac 
ctgtggaaga 
ctgtttcctc 
ttacttctgt 



cacttccacg 
ctctggactg 
acctcaaaat 
tatagagact 
agcacatgta 
aatggttcct 
gaccacagac 
caccctccag 

gggggctatt 

aactcctgct 
aaactctcct 
agctacatca 
tccatcagtg 
tgtcaaggac 
tcaagcacct 
cagtacctcc 

aggcaaggtg 

tttggatgac 
cataaacaca 
tgtctcagct 
ggaagacacc 
tgagacaact 
ctcgtccaca 
atccaggaca 
agtgtcacta 
agaatctgca 
tacttttacc 
tcatggattt 
atggacaagt 
tgcaatgacc 
tatgacttca 
ggaacctgtg 
ttctaaagac 
tgtgaaagcc 
acccaaagcc 
atcaatgcca 
cttgactcct 
aagttttcta 
caactcttct 
cttcacagga 
gacgatcacc 
cacatcaacc 
ctcagaggtg 
ccctgtggaa 
ttctcctgtt 
acttcctact 
tgtaaccagt 
agacactgca 

agggacttcc 

gaaagccaca 
aactcctaat 
tagactgagg 
cttttcttat 
cagaacatcc 
gatcaccagg 
tcaaacaact 
acttttccag 
cactcttcgg 
aaccagctct 
cacatcacca 
tctggtgaca 
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17461 
17521 
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17641 
17701 
17761 
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18181 
18241 
18301 
18361 
18421 
18481 
18541 
18601 
18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 
19261 
19321 
19381 
19441 
19501 
19561 
19621 
19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 



accaccaatg 
agcagcccca 
catgcttcca 
gaatcacaat 
atcaccttcg 
agaattcaga 
gaggagatca 
actactgagg 
gatcattcca 
ccttttgtaa 
atctcacagg 
tcacctgtga 
aagggcgtgt 
gtgcctttac 
cccacttctg 
atgttggaca 
ggtgagatac 
acagcagaaa 
atccactccc 
gctattgttt 
acatcccctc 
acagagtcaa 
gcagaagtca 
ccagacattt 
acacacaaga 
accctggaca 
tttgtagatt 
agccctttct 
acctcacctt 
gtgacctcag 
cctgtgactc 
gatacacaca 
tcaccaaatg 
gcagtagtta 
tctgcgatga 
actagcacct 
ttcactgctg 
atccaaggca 
ctttctactt 
cctcataggg 
gcagggaccc 
agtagagttc 
tcctcttccc 
ccagaaagta 
aagaccacag 
cataagatac 
aacatagcag 
ccagcctact 
gacaccattg 
tcaacattct 
tccacagaga 
acagaagttg 
ccagacatct 
tcaaatatga 
tttaccttgg 
gaatttccac 
acaatccctc 
atgacttcac 
atgacctcac 
gaacctacaa 
gatgaagaca 



tattgggcac 
cacaggagag 
tgcatacaaa 
cttctgtccc 
ccatggggga 
ctgaatcaac 
acactgtgac 
tctccaggac 
aaatgtcacc 
caggatccac 
ctacccttac 
ctcagagatt 
catggcaaag 
ctgcaataac 
ctctccctgt 
cacactcaga 
tcacttctga 
ccaatatggg 
agccatccgg 
ccacatcaac 
tgactcctga 
acactgtttt 
cctactatga 
cacctgaagc 
ccatcgccac 
catcaaccat 
cagaaacaac 
ctgcagaaga 
ctcctgtaac 
tacccacccc 
gttcacctca 
caatgcatcc 
aattctattt 
tcacttccac 
aaaagattga 
cccagaaaat 
ctactactga 
ctgaaaagcc 
ttgctggcct 
cgacatcaca 
actcagctat 
ctgagtacat 
ttctgtcttt 
ggccgtcttc 
atatgctggc 
cgactacttc 
taaccaatgt 
cagaaccacc 
tttccacatc 
ccgtggctca 
aaagtgctgt 
cctcttctag 
ccactgaagt 
ccatcatcac 
acacaccaac 
actcagaaat 
cttctataga 
ctcctgtttc 
tgctcacccc 
ccagttcacc 
ccacagctat 



aacaagccca 
actgaccact 
cactgcagtg 
agctgattca 
tacaagtgtt 
atcctctttg 
agagaccagc 
agaagttatc 
ctacatctcc 
agaaatggcc 
cctggacaca 
tccacactca 
ccctccctct 
ctcacattca 
gacttccctt 
acttgtgacc 
agcctccaca 
gaccaccaat 
acacacacct 
acctggttct 
actgaaagag 
ctccagtgtg 
tcctacattc 
cagcagcagt 
acaaacaggt 
agccacctca 
cagtgtcatg 
agccaactct 
ttccacattg 
tacactggcg 
aaatttaagg 
ttctataaac 
tactgtctca 
ctcgggggac 
gtctgagaca 
tggctcatcc 
ggtctccaga 
cacaatgtca 
gacaaaatcc 
gggtaccctt 
gactcatgga 
atcagggaca 
accagcaata 
tcctgttcat 
atctgtggcc 
agaagacatt 
ggggaccacc 
caaagtcacc 
catgcctggc 
tgggctgaag 
ccttcacaag 
aagaacatcc 
gatccccagc 
tcgaacaggt 
tacatcctcc 
gaccactgtc 
gaaaaccagc 
ctcaacatta 
tagcctagtg 
tccaaatttg 
agaagccatg 



gagaccgtaa 
tacaaagaca 
gccaacgtcg 
cacacatcca 
tctacatcaa 
attcctggat 
actgtccttt 
acttccagca 
acagaaacca 
atcaccaacc 
tcaagcacag 
gaggagacca 
gtggaagaaa 
tctctttatt 
ctcacctctg 
agctccttac 
aatacagaga 
tctatgcata 
ccaaaggtta 
cctgagacta 
gacagcaccg 
tccctggatg 
atgccagctt 
cattctaact 
ccttctgggg 
gcaggaactc 
aacaatgatc 
ctctcttctc 
caagagcaca 
aagatcacag 
aacaccttgg 
acagcaatgg 
cctgactcag 
tcaatagttt 
actttctccc 
tcagacacaa 
acagaactca 
ccggacacct 
gaagaaagga 
acctgggaca 
ttttcacaat 
agcccaccct 
acctcaccgt 
ctgacttcac 
agtttacctc 
aaagatacag 
acttctgaaa 
tctccaatgg 
tcctctgaga 
ggaaccagca 
ttgaccactg 
attccaggcc 
ctgcctatct 
cctcctcttg 
agggcaggaa 
atgaacaagg 
ttctcctctt 
ccaaagacca 
atgaccacag 
agcagtacct 
catccttcca 



cgagttcacc 

ctgcgcacac 

ggacctccat 

aagccacatc 

ctcctgcctt 

taagggacac 

cagaagtgcc 

gaacaaccat 

tcaccaggct 

aaacaggtcc 

cttcctggga 

ctactatgag 

ccagttctcc 

ccgcagtatc 

gcaggaggaa 

caagtgcaag 

caattcactt 

aactacattc 

ctggatctat 

aaaatgttga 

ccctggtgat 

ctgctactga 

ctgctcagtc 

ctcctccctt 

tgacatctct 

catcagccag 

tcaatgatgt 

aggcacctct 

gtacctcctc 

acatggacac 

ccacttcaga 

ccaatgtggg 

acccatataa 

ccacatcaat 

tgatatttag 

gcacggtctt 

cctcctctag 

ccacaagatc 

ccattgccac 

catcaatcac 

tagatttgtc 

ctgtggaaaa 

cccctgtacc 

tccccacctc 

caaacttggg 

agaaaatgta 

aggaatctta 

ttacctcttt 

ttacaaggat 

cctcccagga 

gtgctactga 

ctgatcattc 

cccttggcat 

gctctacatc 

cacactcgat 

accctgagat 

ccctgatgcc 

ttcacaccac 

acacattggg 

cacatgtgat 

caagcacagc 



tccaaattta 
agaagccatg 
ttctggacat 
tccaatgggt 
ctttgagact 
caggacgtct 
cactactact 
ctcagggcct 
ctccactttt 
tatagggact 
agggactcac 
cagaagtact 
ttcttcccca 
aggaagtagc 
gaccatagac 
tagcttctca 
ttcagagaac 
ctctgtctca 
gatggaggac 
cagagactca 
gaactcaact 
ggtctccagg 
aacaaagtcc 
gacaatatct 
tggccaactg 
aactcaggat 
gttgaagaca 
ccttgtgaca 
tcttgtttct 
aaacttagaa 
agccaccaca 
gaccaccagt 
agccacatcc 
gcctagatcc 
actgagggag 
tgacaaagca 
cagaacatcc 
tgtcaccatg 
ccaaacaggt 
aacctcacag 
cactcttacg 
aaccagctct 
tactacatta 
tggcctagtg 
cagcacctca 
tccttccaca 
ttcgtctgtc 
caacataagg 
tgagatggag 
ccccatcgta 
gacctctagg 
cacagagtca 
tacagaatct 
acagggcaca 
ggcgactcag 
tctatcatgg 
ttcaccagcc 
tccttctcct 
cacaagccca 
actgacaaca 
agcgactaat 
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2 0881 gtggaaacca cctgttctgg acatgggtca caatcctctg tcctaactga ctcagaaaaa 
2 0941 accaaggcca cagctccaat ggataccacc tccaccatgg ggcatacaac tgtttccaca 
21001 tcaatgtctg tttcctctga gactacaaaa attaagagag agtcaacata ttccttgact 
21061 cctggactga gagagaccag catttcccaa aatgccagct tttccactga cacaagtatt 
5 21121 gttctttcag aagtccccac tggtactact gctgaggtct ccaggacaga agtcacctcc 

21181 tctggtagaa catccatccc tggcccttct cagtccacag ttttgccaga aatatccaca 
21241 agaacaatga caaggctctt tgcctcgccc accatgacag aatcagcaga aatgaccatc 
21301 cccactcaaa caggtccttc tgggtctacc tcacaggata cccttacctt ggacacatcc 
21361 accacaaagt cccaggcaaa gactcattca actttgactc agagatttcc acactcagag 

10 21421 atgaccactc tcatgagcag aggtcctgga gatatgtcat ggcaaagctc tccctctctg 

21481 gaaaatccca gctctctccc ttccctgctg tctttacctg ccacaacctc acctcctccc 
21541 atttcctcca cattaccagt gactatctcc tcctctcctc ttcctgtgac ttcacttctc 
21601 acctctagcc cggtaacgac cacagacatg ttacacacaa gcccagaact tgtaaccagt 
21661 tcacctccaa agctgagcca cacttcagat gagagactga ccactggcaa ggacaccaca 

15 21721 aatacagaag ctgtgcatcc ttccacaaac acagcagcgt ccaatgtgga gattcccagc 

21781 tttggacatg aatccccttc ctctgcctta gctgactcag agacatccaa agccacatca 
21841 ccaatgttta ttacctccac ccaggaggat acaactgttg ccatatcaac ccctcacttc 
21901 ttggagacta gcagaattca gaaagagtca atttcctccc tgagccctaa attgagggag 
21961 acaggcagtt ctgtggagac aagctcagcc atagagacaa gtgctgtcct ttctgaagtg 

20 22 021 tccattggtg ctactactga gatctccagg acagaagtca cctcctctag cagaacatcc 

22 081 atctctggtt ctgctgagtc cacaatgttg ccagaaatat ccaccacaag aaaaatcatt 
22141 aagttcccta cttcccccat cctggcagaa tcatcagaaa tgaccatcaa gacccaaaca 
222 01 agtcctcctg ggtctacatc agagagtacc tttacattag acacatcaac cactccctcc 
2 2261 ttggtaataa cccattcgac tatgactcag agattgccac actcagagat aaccactctt 

25 22321 gtgagtagag gtgctgggga tgtgccacgg cccagctctc tccctgtgga agaaacaagc 

22381 cctccatctt cccagctgtc tttatctgcc atgatctcac cttctcctgt ttcttccaca 
22441 ttaccagcaa gtagccactc ctcttctgct tctgtgactt cacctctcac accaggccaa 
225 01 gtgaagacta ctgaggtgtt ggacgcaagt gcagaacctg aaaccagttc acctccaagt 
22561 ttgagcagca cctcagttga aatactggcc acctctgaag tcaccacaga tacggagaaa 

30 22621 attcatcctt tcccaaacac ggcagtaacc aaagttggaa cttccagttc tggacatgaa 

22 681 tccccttcct ctgtcctacc tgactcagag acaaccaaag ccacatcggc aatgggtacc 
22741 atctccatta tgggggatac aagtgtttct acattaactc ctgccttatc taacactagg 
22801 aaaattcagt cagagccagc ttcctcactg accaccagat tgagggagac cagcacctct 

22 8 61 gaagagacca gcttagccac agaagcaaac actgttcttt ctaaagtgtc cactggtgct 
35 22921 actactgagg tctccaggac agaagccatc tcctttagca gaacatccat gtcaggccct 

2 2981 gagcagtcca caatgtcaca agacatctcc ataggaacca tccccaggat ttctgcctcc 
2 3 041 tctgtcctga cagaatctgc aaaaatgacc atcacaaccc aaacaggtcc ttcggagtct 
23101 acactagaaa gtacccttaa tttgaacaca gcaaccacac cctcttgggt ggaaacccac 
23161 tctatagtaa ttcagggatt tccacaccca gagatgacca cttccatggg cagaggtcct 
23221 ggaggtgtgt catggcctag ccctcccttt gtgaaagaaa ccagccctcc atcctccccg 
23281 ctgtctttac ctgccgtgac ctcacctcat cctgtttcca ccacattcct agcacatatc 
23341 cccccctctc cccttcctgt gacttcactt ctcacctctg gcccggcgac aaccacagat 
23401 atcttgggta caagcacaga acctggaacc agttcatctt caagtttgag caccacctcc 
23461 catgagagac tgaccactta caaagacact gcacatacag aagccgtgca tccttccaca 

23 521 aacacaggag ggaccaatgt ggcaaccacc agctctggat ataaatcaca gtcctctgtc 
23581 ctagctgact catctccaat gtgtaccacc tccaccatgg gggatacaag tgttctcaca 
2 3 641 tcaactcctg ccttccttga gactaggagg attcagacag agctagcttc ctccctgacc 
23701 cctggattga gggagtccag tggctctgaa gggaccagct caggcaccaa gatgagcact 
23761 gtcctctcta aagtgcccac tggtgctact actgagatct ccaaggaaga cgtcacctcc 

50 2 3 821 atcccaggtc ccgctcaatc cacaatatca ccagacatct ccacaagaac cgtcagctgg 

2 3 881 ttctctacat cccctgtcat gacagaatca gcagaaataa ccatgaacac ccatacaagt 
23941 cctttagggg ccacaacaca aggcaccagt actttggcca cgtcaagcac aacctctttg 
24001 acaatgacac actcaactat atctcaagga ttttcacact cacagatgag cactcttatg 
24061 aggaggggtc ctgaggatgt atcatggatg agccctcccc ttctggaaaa aactagacct 
55 24121 tccttttctc tgatgtcttc accagccaca acttcacctt ctcctgtttc ctccacatta 

24181 ccagagagca tctcttcctc tcctcttcct gtgacttcac tcctcacgtc tggcttggca 
24241 aaaactacag atatgttgca caaaagctca gaacctgtaa ccaactcacc tgcaaatttg 
24301 agcagcacct cagttgaaat actggccacc tctgaagtca ccacagatac agagaaaact 
243 61 catccttctt caaacagaac agtgaccgat gtggggacct ccagttctgg acatgaatcc 
24421 acttcctttg tcctagctga ctcacagaca tccaaagtca catctccaat ggttattacc 
24481 tccaccatgg aggatacgag tgtctccaca tcaactcctg gcttttttga gactagcaga 
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24541 attcagacag aaccaacatc ctccctgacc cttggactga gaaagaccag cagctctgag 
24 601 gggaccagct tagccacaga gatgagcact gtcctttctg gagtgcccac tggtgccact 
24661 gctgaagtct ccaggacaga agtcacctcc tctagcagaa catccatctc aggctttgct 
24721 cagctcacag tgtcaccaga gacttccaca gaaaccatca ccagactccc tacctccagc 
5 2 4781 ataatgacag aatcagcaga aatgatgatc aagacacaaa cagatcctcc tgggtctaca 

24841 ccagagagta ctcatactgt ggacatatca acaacaccca actgggtaga aacccactcg 
24901 actgtgactc agagattttc acactcagag atgaccactc ttgtgagcag aagccctggt 
24961 gatatgttat ggcctagtca atcctctgtg gaagaaacca gctctgcctc ttccctgctg 
25021 tctctgcctg ccacgacctc accttctcct gtttcctcta cattagtaga ggatttccct 

10 25081 tccgcttctc ttcctgtgac ttctcttctc acccctggcc tggtgataac cacagacagg 

2 5141 atgggcataa gcagagaacc tggaaccagt tccacttcaa atttgagcag cacctcccat 
2 52 01 gagagactga ccactttgga agacactgta gatacagaag acatgcagcc ttccacacac 
2 5261 acagcagtga ccaacgtgag gacctccatt tctggacatg aatcacaatc ttctgtccta 
2 5321 tctgactcag agacacccaa agccacatct ccaatgggta ccacctacac catgggggaa 

15 2 5381 acgagtgttt ccatatccac ttctgacttc tttgagacca gcagaattca gatagaacca 

2 5441 acatcctccc tgacttctgg attgagggag accagcagct ctgagaggat cagctcagcc 
2 5501 acagagggaa gcactgtcct ttctgaagtg cccagtggtg ctaccactga ggtctccagg 
25561 acagaagtga tatcctctag gggaacatcc atgtcagggc ctgatcagtt caccatatca 
25621 ccagacatct ctactgaagc gatcaccagg ctttctactt cccccattat gacagaatca 

20 2 5681 gcagaaagtg ccatcactat tgagacaggt tctcctgggg ctacatcaga gggtaccctc 

2 5741 accttggaca cctcaacaac aaccttttgg tcagggaccc actcaactgc atctccagga 
2 5801 ttttcacact cagagatgac cactcttatg agtagaactc ctggagatgt gccatggccg 
25861 agccttccct ctgtggaaga agccagctct gtctcttcct cactgtcttc acctgccatg 
25921 acctcaactt cttttttctc cgcattacca gagagcatct cctcctctcc tcatcctgtg 

25 2 5981 actgcacttc tcacccttgg cccagtgaag accacagaca tgttgcgcac aagctcagaa 

2 6041 cctgaaacca gttcacctcc aaatttgagc agcacctcag ctgaaatatt agccacgtct 
2 6101 gaagtcacca aagatagaga gaaaattcat ccctcctcaa acacacctgt agtcaatgta 
2 6161 gggactgtga tttataaaca tctatcccct tcctctgttt tggctgactt agtgacaaca 
2 6221 aaacccacat ctccaatggc taccacctcc actctgggga atacaagtgt ttccacatca 

30 26281 actcctgcct tcccagaaac tatgatgaca cagccaactt cctccctgac ttctggatta 

2 6341 agggagatca gtacctctca agagaccagc tcagcaacag agagaagtgc ttctctttct 
2 6401 ggaatgccca ctggtgctac tactaaggtc tccagaacag aagccctctc cttaggcaga 
2 6461 acatccaccc caggtcctgc tcaatccaca atatcaccag aaatctccac ggaaaccatc 
2 6521 actagaattt ctactcccct caccacgaca ggatcagcag aaatgaccat cacccccaaa 

35 2 6581 acaggtcatt ctggggcatc ctcacaaggt acctttacct tggacacatc aagcagagcc 

2 6641 tcctggccag gaactcactc agctgcaact cacagatctc cacactcagg gatgaccact 
2 6701 cctatgagca gaggtcctga ggatgtgtca tggccaagcc gcccatcagt ggaaaaaact 
2 6761 agccctccat cttccctggt gtctttatct gcagtaacct caccttcgcc actttattcc 
2 6821 acaccatctg agagtagcca ctcatctcct ctccgggtga cttctctttt cacccctgtc 

40 2 6881 atgatgaaga ccacagacat gttggacaca agcttggaac ctgtgaccac ttcacctccc 

2 6941 agtatgaata tcacctcaga tgagagtctg gccacttcta aagccaccat ggagacagag 
2 7001 gcaattcagc tttcagaaaa cacagctgtg actcagatgg gcaccatcag cgctagacaa 
27061 gaattctatt cctcttatcc aggcctccca gagccatcca aagtgacatc tccagtggtc 
27121 acctcttcca ccataaaaga cattgtttct acaaccatac ctgcttcctc tgagataaca 

45 27181 agaattgaga tggagtcaac atccaccctg acccccacac caagggagac cagcacctcc 

2 7241 caggagatcc actcagccac aaagccaagc actgttcctt acaaggcact cactagtgcc 
2 73 01 acgattgagg actccatgac acaagtcatg tcctctagca gaggacctag ccctgatcag 
2 73 61 tccacaatgt cacaagacat atccagtgaa gtgatcacca ggctctctac ctcccccatc 
2 7421 aaggcagaat ctacagaaat gaccattacc acccaaacag gttctcctgg ggctacatca 

50 27481 aggggtaccc ttaccttgga cacttcaaca acttttatgt cagggaccca ctcaactgca 

27541 tctcaaggat tttcacactc acagatgacc gctcttatga gtagaactcc tggagatgtg 
2 7 601 ccatggctaa gccatccctc tgtggaagaa gccagctctg cctctttctc actgtcttca 
2 7661 cctgtcatga cctcatcttc tcccgtttct tccacattac cagacagcat ccactcttct 
2 7721 tcgcttcctg tgacatcact tctcacctca gggctggtga agaccacaga gctgttgggc 

55 2 7781 acaagctcag aacctgaaac cagttcaccc ccaaatttga gcagcacctc agctgaaata 

2 7841 ctggccacca ctgaagtcac tacagataca gagaaactgg agatgaccaa tgtggtaacc 
27901 tcaggttata cacatgaatc tccttcctct gtcctagctg actcagtgac aacaaaggcc 
27961 acatcttcaa tgggtatcac ctaccccaca ggagatacaa atgttctcac atcaacccct 
2 8 021 gccttctctg acaccagtag gattcaaaca aagtcaaagc tctcactgac tcctgggttg 

60 2 8 081 atggagacca gcatctctga agagaccagc tctgccacag aaaaaagcac tgtcctttct 

2 8141 agtgtgccca ctggtgctac tactgaggtc tccaggacag aagccatctc ttctagcaga 
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acatccatcc 
actagaattt 
acaggtcctt 
tcctggccag 
cctatgagca 
agccctccat 
acaccatctg 
atgatgaagg 
aatatgaata 
gcaattcacg 
gaactctatt 
acctcttcct 
acaaggattg 
tcccaggaca 
gccactcctg 
cctgctcagt 
tctcccatca 
gctacatccc 
tcaactatgt 
gaaagtctgt 
acatcattac 
ccctcctctc 
gtgttggata 
gttgaaatac 
aacacagcgg 
ctagctgact 
gataccactg 
ccaacattct 
atcacagaaa 
atgacagaaa 
tctccagaca 
tcaacacaaa 
cttaccttgg 
tttttacact 
agctctccct 
acctcacctt 
acttcactcc 
agtttatctc 
acagatacag 
agttctggac 
tacccagtgg 
aattttgaga 
aagacaaaca 
gggtccactc 
tccccagact 
tttaatgctt 
tttactatgt 
gagactattt 
accactggag 
ggccctgggg 
ccattctcct 
ataacttcct 
actactgaag 
acatcttcaa 
acaagtgctt 
gaacacatca 
ggccctcaga 
agaatggaga 
agagccacct 
gcatcaaggc 
ttccctgatg 



caggccctgc 
ctacccccct 

ctggggctac 

gaactcactc 
gaggtcctga 
cttccctggt 
ggagtagcca 
ccacagacat 
tcacctcaga 
tttttgaaaa 
cctcttcccc 
ctataagaga 
agatagagtc 
tcacctcatc 
aggtctccag 
ccacaatgtc 
tgacagaatc 
aggttaccct 
ctcaaggact 
catggacgag 
ctctcacgac 
ctcttcctgt 
caagctcaga 
cggccacctc 
tggccaaagt 
cagaaacaac 
ttttcacatc 
cattgactcc 
caagtgcagt 
tcatgtcctc 
tcatcactga 
tgaccatcac 
ccacaacaac 
cagagatgac 
ttgtggaaaa 
ctgtttcttc 
tcaccccagg 
caaatctgag 
agaaaattca 
atgaactata 
gtactccctc 
ccacaggatt 
tgtccctgga 
accttttaca 
ggcctccagc 
ctccatctat 
ctgtaacaga 
ccactggcac 
ttccacgagc 
atgctactct 
cttcaacctt 
cttcagctac 
gacgcttggt 
cacccatttt 
atcaagttcc 
caaaaatacc 
catccacttc 
ccaccaccac 
tgaccaccag 
aaatggccag 
ttccagaaac 



tcaatccaca 
cacaaggaaa 
ctcgcagggt 
agctacaact 
ggatgtgtca 
atcttcatct 
ctcctctcct 
gttggatgca 
tgagagtctg 
tacagcagcg 
aggcttctca 
caacatggtt 
aatgtcatct 
cacagagaca 
gacagaagtt 
actagacatc 
tgcagaaata 
tcccttgggc 
ttcacactca 
ccctcgcttt 
ctcactttct 
gacttcactt 
gcctaaaacc 
tgaaatcatg 
gaggacctcc 
cataaccata 
aaatcctgcc 
tggattcagg 
cctttatgga 
taatagaaca 
agtgatcacc 
cacccaaaaa 
agcccccttg 
aactcttatg 
aactagctct 
cacattaccg 
catggtgaag 
tggcacctca 
tccttcttca 
ttcctctgtt 
ttccatggct 
tgaggctgag 
caccagctca 
gagttccaag 
ctcacagtat 
tacggagtcc 
aagtactcat 
agtgatgcct 
catctcaggt 
gtccaccatt 
cactaccact 
cccatataga 
tatggtcagt 
ggataccaga 
ttcactctca 
caatgaagca 
gcctgccagt 
agctttgaag 
tgtctatact 
cacaatcctc 
gacatcctca 



atgtcatcag 
gaatcaacag 
acctttacct 
cagagatttc 
tggccaagcc 
tcagtaacct 
gtccctgtca 
agtttggaac 
gccacttcta 
tcccatgtgg 
gagccaacaa 
tccacaacaa 
ctgacccctg 
agcactgtcc 
atgccctcta 
tccgatgaag 
accatcacca 
acctcaatga 
gagatgacca 
gtggaaacaa 
cctgtgtcct 
atcctcccag 
agttcatctc 
acagatacag 
agttctgttc 
ccttcaatgg 
ttctctgaga 
gagactagca 
gtgcccacta 
cacatccctg 
aggctctctt 
agttctcctg 
gcaaggaccc 
agtaggagtc 
tcatcttctc 
cagagtatcc 
actacagaca 
gttgaaatac 
agcatggcag 
tcaatccact 
gaaacctcta 
ccattttctc 
gtcacaccaa 
actgatttca 
actgaaattc 
actgggataa 
catctgagta 
tctctatcag 
tcaggtagtc 
gcagagagcc 
gattcttcaa 
gtggacacca 
actttggaca 
atgacagaga 
acacggttga 
gcacacagag 
cctaaaggac 
accaccacca 
cccactttgg 
acagaaatga 
ttggctacca 



acacctccat 
acatggccat 
tggactcatc 
cacagtcagt 
cgctgtctgt 
caccttcgcc 
cttctctttt 
ctgagaccac 
aagccaccac 
aaaccaccag 
aagtgatatc 
tgcctggctc 
gactgaggga 
tttacaagat 
gcagaacatc 
ttgtcaccag 
cccaaacagg 
cctttttgtc 
atcttatgag 
ctagatcttc 
ccacattact 
gcctggtgaa 
caaatttgag 
agaaaattca 
atgaatctca 
gtatcacctc 
ctaggaggat 
cctctgaaga 
gtgctactac 
actctgatca 
cctcatccat 
gggctacagc 
actcaactgt 
ctgaaaatcc 
tgttgtcctt 
cttcctcctc 
caagcacaga 
tggctgcctc 
tgaccaatgt 
cggagccatc 
tttccacatc 
atttgacttc 
caaatacacc 
cctcttctgc 
cagtggacat 
cctccttccc 
cagatttgct 
aggccatgac 
cattctctag 
tgccttcatc 
ccatcccagc 
gtcttgggac 
cttcaagcca 
gcgttgagct 
caagaactga 
gtaccataag 
tacacacagg 
cagctttgaa 
gaacactgac 
tgatcacaac 
gcctgggagc 



ggaaaccatc 
cacccccaaa 
aagcacagcc 
ggtgacaact 
ggaaaaaaac 
actttattcc 
cacctctatc 
ttcagctccc 
ggagacagag 
tgctacagag 
tccagtggtc 
ctctggcatt 
gaccagaacc 
gtcctctggt 
cattcctggc 
gctgtctacc 
ttattctctg 
agggacccac 
caggggtcct 
ctcttctctg 
agacagtagc 
gactacagaa 
cagcacctca 
tccttcctca 
ttcctctgtc 
cgctgtggac 
tccgacagag 
gaccacctca 
tgaagtctcc 
gtccacgatg 
gatgtcagaa 
acagagtact 
tcctcctaga 
atcatggaag 
acctgtcacg 
tttttctgtg 
acctggaacc 
tgaagtcacc 
gggaaccacc 
caaggctaca 
aatgcctgct 
tggatttagg 
ttcttctcct 
aaaaacatca 
aatcaccccc 
agaatccagg 
gccttcagct 
ttcatttgcc 
gacagagtca 
cactcctgtg 
cctccatgag 
agagagcagc 
accaggcagg 
gggaacagtg 
tggcattatg 
accagtcaaa 
agggacaaaa 
gaccacttcc 
tcccctcaat 
cccatatgtt 
agaaaccagc 
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31861 acagctcttc ccaggacaac cccatctgtt ctcaatagag aatcagagac cacagcctca 
31921 ctggtctctc gttctggggc agagagaagt ccggttattc aaactctaga tgtttcttct 
31981 agtgagccag atacaacagc ttcatgggtt atccatcctg cagagaccat cccaactgtt 
32041 tccaagacaa cccccaattt tttccacagt gaattagaca ctgtatcttc cacagccacc 
5 3 2101 agtcatgggg cagacgtcag ctcagccatt ccaacaaata tctcacctag tgaactagat 

32161 gcactgaccc cactggtcac tatttcgggg acagatacta gtacaacatt cccaacactg 
3 2221 actaagtccc cacatgaaac agagacaaga accacatggc tcactcatcc tgcagagacc 
32281 agctcaacta ttcccagaac aatccccaat ttttctcatc atgaatcaga tgccacacct 
32341 tcaatagcca ccagtcctgg ggcagaaacc agttcagcta ttccaattat gactgtctca 
10 3 24 01 cctggtgcag aagatctggt gacctcacag gtcactagtt ctgggacaga cagaaatatg 

3 2461 actattccaa ctttgactct ttctcctggt gaaccaaaga cgatagcctc attagtcacc 
32521 catcctgaag cacagacaag ttcggccatt ccaacttcaa ctatctcgcc tgctgtatca 
3 2 581 cggttggtga cctcaatggt caccagtttg gcggcaaaga caagtacaac taatcgagct 
3 2 641 ctgacaaact cccctggtga accagctaca acagtttcat tggtcacgca tcctgcacag 
32701 accagcccaa cagttccctg gacaacttcc atttttttcc atagtaaatc agacaccaca 
3 2 761 ccttcaatga ccaccagtca tggggcagaa tccagttcag ctgttccaac tccaactgtt 
3 2 821 tcaactgagg taccaggagt agtgacccct ttggtcacca gttctagggc agtgatcagt 
32881 acaactattc caattctgac tctttctcct ggtgaaccag agaccacacc ttcaatggcc 
3 2 941 accagtcatg gggaagaagc cagttctgct attccaactc caactgtttc acctggggta 
3 3 001 ccaggagtgg tgacctctct ggtcactagt tctagggcag tgactagtac aactattcca 

33 061 attctgactt tttctcttgg tgaaccagag accacacctt caatggccac cagtcatggg 
3 3121 acagaagctg gctcagctgt tccaactgtt ttacctgagg taccaggaat ggtgacctct 
3 3181 ctggttgcta gttctagggc agtaaccagt acaactcttc caactctgac tctttctcct 
3 3 241 ggtgaaccag agaccacacc ttcaatggcc accagtcatg gggcagaagc cagctcaact 

25 333 01 gttccaactg tttcacctga ggtaccagga gtggtgacct ctctggtcac tagttctagt 

33361 ggagtaaaca gtacaagtat tccaactctg attctttctc ctggtgaact agaaaccaca 
3 3421 ccttcaatgg ccaccagtca tggggcagaa gccagctcag ctgttccaac tccaactgtt 
3 3481 tcacctgggg tatcaggagt ggtgacccct ctggtcacta gttccagggc agtgaccagt 
33541 acaactattc caattctaac tctttcttct agtgagccag agaccacacc ttcaatggcc 

30 3 3 601 accagtcatg gggtagaagc cagctcagct gttctaactg tttcacctga ggtaccagga 

3 3 661 atggtgacct ctctggtcac tagttctaga gcagtaacca gtacaactat tccaactctg 
3 3721 actatttctt ctgatgaacc agagaccaca acttcattgg tcacccattc tgaggcaaag 
33781 atgatttcag ccattccaac tttagctgtc tcccctactg tacaagggct ggtgacttca 
3 3 841 ctggtcacta gttctgggtc agagaccagt gcgttttcaa atctaactgt tgcctcaagt 

35 3 3 901 caaccagaga ccatagactc atgggtcgct catcctggga cagaagcaag ttctgttgtt 

33961 ccaactttga ctgtctccac tggtgagccg tttacaaata tctcattggt cacccatcct 

34 021 gcagagagta gctcaactct tcccaggaca acctcaaggt tttcccacag tgaattagac 
34081 actatgcctt ctacagtcac cagtcctgag gcagaatcca gctcagccat ttcaactact 
34141 atttcacctg gtataccagg tgtgctgaca tcactggtca ctagctctgg gagagacatc 
342 01 agtgcaactt ttccaacagt gcctgagtcc ccacatgaat cagaggcaac agcctcatgg 

342 61 gttactcatc ctgcagtcac cagcacaaca gttcccagga caacccctaa ttattctcat 
34321 agtgaaccag acaccacacc atcaatagcc accagtcctg gggcagaagc cacttcagat 

343 81 tttccaacaa taactgtctc acctgatgta ccagatatgg taacctcaca ggtcactagt 
34441 tctgggacag acaccagtat aactattcca actctgactc tttcttctgg tgagccagag 
34501 accacaacct catttatcac ctattctgag acacacacaa gttcagccat tccaactctc 
34561 cctgtctccc ctggtgcatc aaagatgctg acctcactgg tcatcagttc tgggacagac 
3 4621 agcactacaa ctttcccaac actgacggag accccatatg aaccagagac aacagccata 
34681 cagctcattc atcctgcaga gaccaacaca atggttccca agacaactcc caagttttcc 
34741 catagtaagt cagacaccac actcccagta gccatcacca gtcctgggcc agaagccagt 
348 01 tcagctgttt caacgacaac tatctcacct gatatgtcag atctggtgac ctcactggtc 
3 48 61 cctagttctg ggacagacac cagtacaacc ttcccaacat tgagtgagac cccatatgaa 
34921 ccagagacta cagtcacgtg gctcactcat cctgcagaaa ccagcacaac ggtttctggg 
34981 acaattccca acttttccca taggggatca gacactgcac cctcaatggt caccagtcct 
3 5041 ggagtagaca cgaggtcagg tgttccaact acaaccatcc cacccagtat accaggggta 
3 5101 gtgacctcac aggtcactag ttctgcaaca gacactagta cagctattcc aactttgact 
3 5161 ccttctcctg gtgaaccaga gaccacagcc tcatcagcta cccatcctgg gacacagact 
3 5221 ggcttcactg ttccaattcg gactgttccc tctagtgagc cagatacaat ggcttcctgg 
3 5281 gtcactcatc ctccacagac cagcacacct gtttccagaa caacctccag tttttcccat 
3 5341 agtagtccag atgccacacc tgtaatggcc accagtccta ggacagaagc cagttcagct 
3 5401 gtactgacaa caatctcacc tggtgcacca gagatggtga cttcacagat cactagttct 
3 5461 ggggcagcaa ccagtacaac tgttccaact ttgactcatt ctcctggtat gccagagacc 
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3 5521 acagccttat tgagcaccca tcccagaaca gggacaagta aaacatttcc tgcttcaact 
3 5581 gtgtttcctc aagtatcaga gaccacagcc tcactcacca ttagacctgg tgcagagact 
35641 agcacagctc tcccaactca gacaacatcc tctctcttca ccctacttgt aactggaacc 
3 5701 agcagagttg atctaagtcc aactgcttca cctggtgttt ctgcaaaaac agccccactt 
5 35761 tccacccatc cagggacaga gaccagcaca atgattccaa cttcaactct ttcccttggt 

3 5821 ttactagaga ctacaggctt actggccacc agctcttcag cagagaccag cacgagtact 
35881 ctaactctga ctgtttcccc tgctgtctct gggctttcca gtgcctctat aacaactgat 
35941 aagccccaaa ctgtgacctc ctggaacaca gaaacctcac catctgtaac ttcagttgga 
3 6001 cccccagaat tttccaggac tgtcacaggc accactatga ccttgatacc atcagagatg 

10 3 6061 ccaacaccac ctaaaaccag tcatggagaa ggagtgagtc caaccactat cttgagaact 

3 6121 acaatggttg aagccactaa tttagctacc acaggttcca gtcccactgt ggccaagaca 
3 6181 acaaccacct tcaatacact ggctggaagc ctctttactc ctctgaccac acctgggatg 
3 6241 tccaccttgg cctctgagag tgtgacctca agaacaagtt ataaccatcg gtcctggatc 
3 63 01 tccaccacca gcagttataa ccgtcggtac tggacccctg ccaccagcac tccagtgact 

15 36361 tctacattct ccccagggat ttccacatcc tccatcccca gctccacagc agccacagtc 

3 6421 ccattcatgg tgccattcac cctcaacttc accatcacca acctgcagta cgaggaggac 
3 6481 atgcggcacc ctggttccag gaagttcaac gccacagaga gagaactgca gggtctgctc 
3 6541 aaacccttgt tcaggaatag cagtctggaa tacctctatt caggctgcag actagcctca 
3 6601 ctcaggccag agaaggatag ctcagccatg gcagtggatg ccatctgcac acatcgccct 

20 3 6661 gaccctgaag acctcggact ggacagagag cgactgtact gggagctgag caatctgaca 

3 6721 aatggcatcc aggagctggg cccctacacc ctggaccgga acagtctcta tgtcaatggt 
3 6781 ttcacccatc gaagctctat gcccaccacc agcactcctg ggacctccac agtggatgtg 
3 6841 ggaacctcag ggactccatc ctccagcccc agccccacgg ctgctggccc tctcctgatg 
3 6901 ccgttcaccc tcaacttcac catcaccaac ctgcagtacg aggaggacat gcgtcgcact 

25 3 6961 ggctccagga agttcaacac catggagagt gtcctgcagg gtctgctcaa gcccttgttc 

3 7 021 aagaacacca gtgttggccc tctgtactct ggctgcagat tgaccttgct caggcccgag 
3 7 081 aaagatgggg cagccactgg agtggatgcc atctgcaccc accgccttga ccccaaaagc 
3 7141 cctggactca acagggagca gctgtactgg gagctaagca aactgaccaa tgacattgaa 
3 72 01 gagctgggcc cctacaccct ggacaggaac agtctctatg tcaatggttt cacccatcag 

30 3 72 61 agctctgtgt ccaccaccag cactcctggg acctccacag tggatctcag aacctcaggg 

3 7321 actccatcct ccctctccag ccccacaatt atggctgctg gccctctcct ggtaccattc 
3 7381 accctcaact tcaccatcac caacctgcag tatggggagg acatgggtca ccctggctcc 
3 7441 aggaagttca acaccacaga gagggtcctg cagggtctgc ttggtcccat attcaagaac 
3 75 01 accagtgttg gccctctgta ctctggctgc agactgacct ctctcaggtc tgagaaggat 

35 3 7561 ggagcagcca ctggagtgga tgccatctgc atccatcatc ttgaccccaa aagccctgga 

3 7621 ctcaacagag agcggctgta ctgggagctg agccaactga ccaatggcat caaagagctg 
3 7681 ggcccctaca ccctggacag gaacagtctc tatgtcaatg gtttcaccca tcggacctct 
3 7741 gtgcccacca ccagcactcc tgggacctcc acagtggacc ttggaacctc agggactcca 
3 78 01 ttctccctcc caagccccgc aactgctggc cctctcctgg tgctgttcac cctcaacttc 

40 3 7 861 accatcacca acctgaagta tgaggaggac atgcatcgcc ctggctccag gaagttcaac 

3 7921 accactgaga gggtcctgca gactctgctt ggtcctatgt tcaagaacac cagtgttggc 
3 7981 cttctgtact ctggctgcag actgaccttg ctcaggtccg agaaggatgg agcagccact 
3 8 041 ggagtggatg ccatctgcac ccaccgtctt gaccccaaaa gccctggact ggacagagag 
3 8101 cagctatact gggagctgag ccagctgacc aatggcatca aagagctggg cccctacacc 

45 3 8161 ctggacagga acagtctcta tgtcaatggt ttcacccatt ggatccctgt gcccaccagc 

3 8221 agcactcctg ggacctccac agtggacctt gggtcaggga ctccatcctc cctccccagc 
3 8281 cccacagctg ctggccctct cctggtgcca ttcaccctca acttcaccat caccaacctg 
3 8341 cagtacgagg aggacatgca tcacccaggc tccaggaagt tcaacaccac ggagcgggtc 
3 84 01 ctgcagggtc tgcttggtcc catgttcaag aacaccagtg tcggccttct gtactctggc 

50 3 8461 tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 

3 8521 tgcacccacc gtcttgaccc caaaagccct ggagtggaca gggagcagct atactgggag 
3 8581 ctgagccagc tgaccaatgg catcaaagag ctgggtccct acaccctgga cagaaacagt 
3 8641 ctctatgtca atggtttcac ccatcagacc tctgcgccca acaccagcac tcctgggacc 
3 8701 tccacagtgg accttgggac ctcagggact ccatcctccc tccccagccc tacatcngct 

55 3 8761 ggccctctcc tggtnccntt caccctcaac ttcaccatca ccaacctgca gtacgaggag 

3 8821 gacatgcggc acccnggntc caggaagttc aacaccacng agagggtnct gcagggtctg 
3 8881 ctnaagcccc tnttcaagag caccagtgtt ggccctctgt actctggctg cagactgacc 
3 8941 ttgctcaggt ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 
390 01 cttgacccca aaagccctgg agtggacagg gagcagctat actgggagct gagccagctg 

60 3 9061 accaatggca tcaaagagct gggtccctac accctggaca gaaacagtct ctatgtcaat 

3 9121 ggtttcaccc atcagacctc tgcgcccaac accagcactc ctgggacctc cacagtggac 



WO 2004/075713 



18/47 



PCT/CA2004/000281 



3 9181 cttgggacct cagggactcc atcctccctc cccagcccta catctgctgg ccctctcctg 
3 9241 gtgccattca ccctcaactt caccatcacc aacctgcagt acgaggagga catgcatcac 
39301 ccaggctcca ggaagttcaa caccacggag cgggtcctgc agggtctgct tggtcccatg 
3 93 61 ttcaagaaca ccagtgtcgg ccttctgtac tctggctgca gactgacctt gctcaggcct 
5 3 9421 gagaagaatg gggcagccac tggaatggat gccatctgca gccaccgtct tgaccccaaa 

39481 agccctggac tcaacagaga gcagctgtac tgggagctga gccagctgac ccatggcatc 
3 9541 aaagagctgg gcccctacac cctggacagg aacagtctct atgtcaatgg tttcacccat 
3 9601 cggagctctg tggcccccac cagcactcct gggacctcca cagtggacct tgggacctca 
39661 gggactccat cctccctccc cagccccaca acagctgttc ctctcctggt gccgttcacc 
10 3 9721 ctcaacttta ccatcaccaa tctgcagtat ggggaggaca tgcgtcaccc tggctccagg 

39781 aagttcaaca ccacagagag ggtcctgcag ggtctgcttg gtcccttgtt caagaactcc 

3 9 841 agtgtcggcc ctctgtactc tggctgcaga ctgatctctc tcaggtctga gaaggatggg 
39901 gcagccactg gagtggatgc catctgcacc caccacctta accctcaaag ccctggactg 
39961 gacagggagc agctgtactg gcagctgagc cagatgacca atggcatcaa agagctgggc 

15 40021 ccctacaccc tggaccggaa cagtctctac gtcaatggtt tcacccatcg gagctctggg 

40081 ctcaccacca gcactccttg gacttccaca gttgaccttg gaacctcagg gactccatcc 

4 0141 cccgtcccca gccccacaac tgctggccct ctcctggtgc cattcaccct caacttcacc 
4 0201 atcaccaacc tgcagtatga ggaggacatg catcgccctg gatctaggaa gttcaacacc 
4 0261 acagagaggg tcctgcaggg tctgcttagt cccattttca agaactccag tgttggccct 

20 40321 ctgtactctg gctgcagact gacctctctc aggcccgaga aggatggggc agcaactgga 

4 0381 atggatgctg tctgcctcta ccaccctaat cccaaaagac ctggactgga cagagagcag 
4 0441 ctgtactggg agctaagcca gctgacccac aacatcactg agctgggccc ctacagcctg 
40501 gacagggaca gtctctatgt caatggtttc acccatcaga actctgtgcc caccaccagt 
4 0561 actcctggga cctccacagt gtactgggca accactggga ctccatcctc cttccccggc 
25 4 0621 cacacagagc ctggccctct cctgatacca ttcactttca actttaccat caccaacctg 

40681 cattatgagg aaaacatgca acaccctggt tccaggaagt tcaacaccac ggagagggtt 
40741 ctgcagggtc tgctcaagcc cttgttcaag aacaccagtg ttggccctct gtactctggc 
4 08 01 tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 
4 08 61 tgcctctacc accctaatcc caaaagacct gggctggaca gagagcagct gtactgggag 
30 4 0921 ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 

4 0981 ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 
41041 tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacagagcct 
41101 ggccctctcc tgataccatt cactttcaac tttaccatca ccaacctgca ttatgaggaa 
41161 aacatgcaac accctggttc caggaagttc aacaccacgg agagggttct gcagggtctg 
35 41221 ctcaagccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagactgacc 

412 81 ttgctcagac ctgagaagca tgaggcagcc actggagtgg acaccatctg tacccaccgc 
41341 gttgatccca tcggacctgg actggacagg gagcggctat actgggagct gagccagctg 
414 01 accaacagca ttaccgaact gggaccctac accctggaca gggacagtct ctatgtcaat 
41461 ggcttcaacc ctcggagctc tgtgccaacc accagcactc ctgggacctc cacagtgcac 
40 41521 ctggcaacct ctgggactcc atcctccctg cctggccaca cagcccctgt ccctctcttg 

41581 ataccattca ccctcaactt taccatcacc aacctgcatt atgaggaaaa catgcaacac 
41641 cctggttcca ggaagttcaa caccacggag agggttctgc agggtctgct caagcccttg 
41701 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcagacct 
41761 gagaagcatg aggcagccac tggagtggac accatctgta cccaccgcgt tgatcccatc 
45 41821 ggacctggac tgnacagnga gcngctntac tgggagctna gccanctgac caaimncatc 

41881 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
41941 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 
42001 gggactccat cctccntccc cngccncaca tctgctggcc ctctcctggt gccattcacc 
42061 ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 
50 42121 aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 

42181 agtgtcggcc ttctgtactc tggctgcaga ctgaccttgc tcaggcctga gaagaatggg 
42241 gcagccactg gaatggatgc catctgcagc caccgtcttg accccaaaag ccctggactc 
42301 gacagagagc agctgtactg ggagctgagc cagctgaccc atggcatcaa agagctgggc 
42361 ccctacaccc tggacaggaa cagtctctat gtcaatggtt tcacccatcg gagctctgtg 
42421 gcccccacca gcactcctgg gacctccaca gtggaccttg ggacctcagg gactccatcc 
42481 tccctcccca gccccacaac agctgttcct ctcctggtgc cgttcaccct caactttacc 
42541 atcaccaatc tgcagtatgg ggaggacatg cgtcaccctg gctccaggaa gttcaacacc 
42 601 acagagaggg tcctgcaggg tctgcttggt cccttgttca agaactccag tgtcggccct 
42661 ctgtactctg gctgcagact gatctctctc aggtctgaga aggatggggc agccactgga 
42 721 gtggatgcca tctgcaccca ccaccttaac cctcaaagcc ctggactgga cagggagcag 
42781 ctgtactggc agctgagcca gatgaccaat ggcatcaaag agctgggccc ctacaccctg 
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gaccggaaca 

actccttgga 

cccacaactg 

cagtatgagg 

ctgcagggtc 

tgcagactga 

tgcctctacc 

ctaagccagc 

ctctatgtca 

tccacaatgc 

agccctctcc 

gacatgcgtc 

ctcaagccct 

ttgctcaggc 

cttgacccca 

accaatgaca 

ggtttcaccc 

ctcagaacct 

ctcctgntnc 

cnncncccng 

cccttgttca 

aggcctgaga 

cccaaaagcc 

agcatcactg 

aaccctcgga 

acctctggga 

ttcaccntca 

tccaggaagt 

aatagcagtc 

gatagctcag 

ggactggaca 

ctgggcccct 

tctgggctca 

ccatcccccg 

ttcaccatca 

aacaccacgg 

ggccctctgt 

actggagtgg 

gagcggctat 

accctggata 

accagcactc 

cctggccaca 

gacctgcatt 

agggttctgc 

tctggctgca 

gccatctgca 

tgggagctga 

gacagtctct 

gggacctctg 

gcccctggcc 

gaggaggaca 

ggtctgctca 

ctgaccttgc 

catcgtcctg 

cagctgaccc 

gtcaatggtt 

atgcacctgg 

ctcctggtgc 

catcaccctg 

cctgtgttca 

aggcccaaga 



gtctctacgt 
cttccacagt 
ctggccctct 
aggacatgca 
tgcttagtcc 
cctctctcag 
accctaatcc 
tgacccacaa 
atggtttcac 
acctggcaac 

tggtgctatt 

gcactggctc 
tgttcaagaa 
ccaagaaaga 
aaagccctgg 
ttgaagagct 
atcagagctc 
cagggactcc 
cnttcaccnt 
gntccaggaa 
agaacaccag 
aggatggggc 
ctggactgga 
agctgggacc 
gctctgtgcc 
ctccatcctc 
acttnaccat 
tcaacaccac 
tggaatacct 
ccatggcagt 
gagagcgact 
acaccctgga 
ccaccagcac 
tccccagccc 
ccaacctgca 
agagggttct 
actctggctg 
acaccatctg 
actgggagct 
gggacagtct 
ctgggacctc 
cagcccctgt 
atgaagaaaa 
agggtctgct 
gactgacctt 
ccctccgcct 
gccagctgac 
atgtcaatgg 
cagtgcacct 
ctctcctggt 
tgcgtcaccc 
agcccttgtt 
tcaggcctga 
accccaaaag 
acggcatcac 
tcacccatca 
caacctcgag 
tattcacaat 
gctctagaaa 
agaacaccag 
aggatggggc 



caatggtttc 

tgaccttgga 

cctggtgcca 

tcgccctgga 

catattcaag 

gcccgagaag 

caaaagacct 

catcactgag 

ccatcagagc 

ctcgagaact 

cacaatcaac 

caggaagttc 

caccagtgtt 

tggggcagcc 

actcaacagg 

gggcccctac 

tgtgtccacc 

atcctccctc 

caacttnacc 

gttcaacacc 

tgtcagctct 

agccaccaga 

cagagagcaa 

ctacaccctg 

aaccaccagc 

cctgcctggc 

caccaacctg 

ngagagggtt 

ctattcaggc 

ggatgccatc 

gtactgggag 

ccggaacagt 

tccttggact 

cacaactgct 

gtatgaggag 

gcagggtctg 

cagactgacc 

tacccaccgc 

gagccagctg 

ctatgtcaat 

cacagtgcac 

ccctctcttg 

catgcaacac 

caagcccttg 

gctcagacct 

tgatcccact 

caacagcgtt 

cttcacccat 

ggaaacctct 

gccattcacc 

tggttccagg 

caagaacacc 

gaaggatggg 

ccctggactg 

tgagctgggc 

gagctctatg 

aactccagcc 

taacttcacc 

gtttaacacc 

tgttggccct 

agccaccaaa 



acccatcgga 
acctcaggga 
ttcaccctaa 
tctaggaagt 
aactccagtg 
gatggggcag 
ggactggaca 
ctgggcccct 
tctatgacga 
ccagcctccc 
tgcaccatca 
aacaccatgg 
ggccctctgt 
actggagtgg 
gagcagctgt 
accctggaca 
accagcactc 
tccagcccca 
atcaccaacc 
acngagaggg 
ctgtactctg 
gtggatgctg 
ctatactggg 
gacagggtca 
actcctggga 
cacacancnn 
cantangngg 
ctgcagggtc 
tgcagactag 
tgcacacatc 
ctgagcaatc 
ctctacgtca 
tccacagttg 
ggccctctcc 
gacatgcatc 
ctcacgccct 
ttgctcagac 
gttgatccca 
accaacagca 
ggcttcaacc 
ctggcaacct 
ataccattca 
cctggttcca 
ttcaagagca 
gagaaacatg 
ggtcctggac 
acagagctgg 
cggagctctg 
gggactccag 
ctcaacttca 
aagttcagca 
agtgtcagct 
gcagccacca 
gacagagagc 
ccctacaccc 
acgaccacca 
tccctgtctg 
atcactaacc 
acggagagag 
ctgtactctg 
gtggatgcca 



gctctgggct 
ctccatcccc 
acttcaccat 
tcaacgccac 
ttggccctct 
caactggaat 
gagagcagct 
acagcctgga 
ccaccagaac 
tgtctggacc 
ccaacctgca 
agagtgtcct 
actctggctg 
atgccatctg 
actgggagct 
ggaacagtct 

ctgggacctc 

caattatgnc 
tgcantangn 
tcctacaggg 
gttgcagact 
cctgcaccta 
agctgagcca 
gtctctatgt 
cctccacagt 
ctgnccctct 
annacatgcn 
tgctcaaacc 
cctcactcag 
gccctgaccc 
tgacaaatgg 
atggtttcac 
accttggaac 
tggtgccatt 
gccctggttc 
tgttcaagaa 
ctgagaagca 
tcggacctgg 
tcacagagct 
cttggagctc 
ctgggactcc 
ccctcaactt 
ggaagttcaa 
ccagcgttgg 
gggcagccac 
tggacagaga 
gcccctacac 
tgccaaccac 
cctccctccc 
ctatcaccaa 
ccacggagag 
ctctgtactc 
gagtggatgc 
ggctgtactg 
tggacaggca 
gaactcctga 
gacctacgac 
tgcggtatga 
tccttcaggg 
gctgcagact 
tctgcaccta 



caccaccagc 

cgtccccagc 

caccaacctg 

agagagggtc 

gtactctggc 

ggatgctgtc 

gtactgggag 

cagggacagt 

tcctgatacc 

tacgaccgcc 

gtacgaggag 

gcagggtctg 

cagattgacc 

cacccaccgc 

aagcaaactg 

ctatgtcaat 

cacagtggat 

imctgnccct 

ggannacatg 

tctgctcagg 

gaccttgctc 

ccgccctgat 

gctaacccac 

caatggcttc 

gcacctggca 

cctgntnccn 

ncncccnggn 

cttgttcagg 

gccagagaag 

tgaagacctc 

catccaggag 

ccatcggagc 

ctcagggact 

caccctcaac 

caggaggttc 

caccagtgtt 

agaggcagcc 

actggacaga 

gggaccctac 

tgtgccaacc 

atcctccctg 

taccatcacc 

caccacggag 

ccctctgtac 

tggagtggac 

gcggctatac 

cctggacagg 

cagtattcct 

tggccacaca 

cctgcagtat 

agtcctgcag 

tggttgcaga 

tgtctgcacc 

gaagctgagc 

cagtctctat 

tacctccaca 

cgccagccct 

ggagaacatg 

tctgctcagg 

gaccacgctc 

ccgccctgat 
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4 6501 cccaaaagcc ctggactgga cagagagcag ctatactggg agctgagcca gctaacccac 

46561 agcatcactg agctgggccc ctacacccag gacagggaca gtctctatgt caatggcttc 

4 6621 acccatcgga gctctgtgcc aaccaccagt attcctggga cctctgcagt gcacctggaa 

4 6681 acctctggga ctccagcctc cctccctggc cacacagccc ctggccctct cctggtgcca 

5 46741 ttcaccctca acttcactat caccaacctg cagtatgagg aggacatgcg tcaccctggt 

46801 tccaggaagt tcaacaccac ggagagagtc ctgcagggtc tgctcaagcc cttgttcaag 

4 6861 agcaccagtg ttggccctct gtactctggc tgcagactga ccttgctcag gcctgaaaaa 

4 6921 cgtggggcag ccaccggcgt ggacaccatc tgcactcacc gccttgaccc tctaaaccca 

46981 ggactggaca gagagcagct atactgggag ctgagcaaac tgacccgtgg catcatcgag 

10 47041 ctgggcccct acctcctgga cagaggcagt ctctatgtca atggtttcac ccatcggacc 

4 7101 tctgtgccca ccaccagcac tcctgggacc tccacagtgg accttggaac ctcagggact 

4 7161 ccattctccc tcccaagccc cgcancnnct gnccctctcc tgntnccntt caccntcaac 

4 7221 ttnaccatca ccaacctgca ntangnggan nacatgcnnc ncccnggntc caggaagttc 

47281 aacaccacng agagggtcct gcagactctg cttggtccta tgttcaagaa caccagtgtt 

15 47341 ggccttctgt actctggctg cagactgacc ttgctcaggt ccgagaagga tggagcagcc 

4 7401 actggagtgg atgccatctg cacccaccgt cttgacccca aaagccctgg agtggacagg 

4 7461 gagcaactat actgggagct gagccagctg accaatggca ttaaagaact gggcccctac 

47521 accctggaca ggaacagtct ctatgtcaat gggttcaccc attggatccc tgtgcccacc 

47581 agcagcactc ctgggacctc cacagtggac cttgggtcag ggactccatc ctccctcccc 

20 47641 agccccacaa ctgctggccc tctcctggtg ccgttcaccc tcaacttcac catcaccaac 

4 7701 ctgaagtacg aggaggacat gcattgccct ggctccagga agttcaacac cacagagaga 

4 7761 gtcctgcaga gtctgcttgg tcccatgttc aagaacacca gtgttggccc tctgtactct 

4 7 821 ggctgcagac tgaccttgct caggtccgag aaggatggag cagccactgg agtggatgcc 

4 7881 atctgcaccc accgtcttga ccccaaaagc cctggagtgg acagggagca gctatactgg 

25 4 7941 gagctgagcc agctgaccaa tggcatcaaa gagctgggtc cctacaccct ggacagaaac 

4 8 001 agtctctatg tcaatggttt cacccatcag acctctgcgc ccaacaccag cactcctggg 

4 8 061 acctccacag tggaccttgg gacctcaggg actccatcct ccctccccag ccctacancn 

4 8121 nctgnccctc tcctgntncc nttcaccntc aacttnacca tcaccaacct gcantangng 

4 8181 gannacatgc nncncccngg ntccaggaag ttcaacacca cngagngngt nctgcagggt 

30 4 8241 ctgctnnnnc ccntnttcaa gaacnccagt gtnggccntc tgtactctgg ctgcagactg 

4 83 01 acctnnctca ggncngagaa gnatggngca gccactggan tggatgccat ctgcanccac 

4 83 61 cnncntnanc ccaaaagncc tggactgnac agngagcngc tntactggga gctnagccan 

4 8421 ctgaccaann ncatcnnnga gctgggnccc tacaccctgg acaggnacag tctctatgtc 

4 8481 aatggtttca cccattggat ccctgtgccc accagcagca ctcctgggac ctccacagtg 

35 4 8541 gaccttgggt cagggactcc atcctccctc cccagcccca caactgctgg ccctctcctg 

4 8 601 gtgccgttca ccctcaactt caccatcacc aacctgaagt acgaggagga catgcattgc 

4 8 661 cctggctcca ggaagttcaa caccacagag agagtcctgc agagtctgct tggtcccatg 

4 8721 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctc gctcaggtcc 

4 8781 gagaaggatg gagcagccac tggagtggat gccatctgca cccaccgtgt tgaccccaaa 

40 4 8 841 agccctggag tggacaggga gcagctatac tgggagctga gccagctgac caatggcatc 

48901 aaagagctgg gtccctacac cctggacaga aacagtctct atgtcaatgg tttcacccat 

4 8961 cagacctctg cgcccaacac cagcactcct gggacctcca cagtgnacnt nggnacctcn 

49021 gggactccat cctccntccc cngccncaca tctgctggcc ctctcctggt gccattcacc 

4 9081 ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 

45 49141 aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 

492 01 agtgtcggcc ttctgtactc tggctgcaga ctgaccttgc tcaggcctga gaagaatggg 

49261 gcaaccactg gaatggatgc catctgcacc caccgtcttg accccaaaag ccctggactg 

4 9321 nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 

4 9381 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcn ganctctgng 

50 49441 cccaccacca gcactcctgg gacctccaca gtgnacntng gnacctcngg gactccatcc 

4 9501 tccntccccn gccncacanc nnctgnccct ctcctgntnc cnttcaccnt caacttnacc 

49561 atcaccaacc tgcantangn ggannacatg cnncncccng gntccaggaa gttcaacacc 

49621 acngagaggg ttctgcaggg tctgctcaaa cccttgttca ggaatagcag tctggaatac 

49681 ctctattcag gctgcagact agcctcactc aggccagaga aggatagctc agccatggca 

55 4 9741 gtggatgcca tctgcacaca tcgccctgac cctgaagacc tcggactgga cagagagcga 

4 9801 ctgtactggg agctgagcaa tctgacaaat ggcatccagg agctgggccc ctacaccctg 
49861 gaccggaaca gtctctatgt caatggtttc acccatcgaa gctctatgcc caccaccagc 
49921 actcctggga cctccacagt ggatgtggga acctcaggga ctccatcctc cagccccagc 
49981 cccacgactg ctggccctct cctgatacca ttcaccctca acttcaccat caccaacctg 

60 5 0 041 cagtatgggg aggacatggg tcaccctggc tccaggaagt tcaacaccac agagagggtc 

5 0101 ctgcagggtc tgcttggtcc catattcaag aacaccagtg ttggccctct gtactctggc 
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5 0161 tgcagactga cctctctcag gtctgagaag gatggagcag ccactggagt ggatgccatc 
5 0221 tgcatccatc atcttgaccc caaaagccct ggactcaaca gagagcggct gtactgggag 
5 0281 ctgagccaac tgaccaatgg catcaaagag ctgggcccct acaccctgga caggaacagt 
5 0341 ctctatgtca atggtttcac ccatcggacc tctgtgccca ccaccagcac tcctgggacc 
5 5 04 01 tccacagtgg accttggaac ctcagggact ccattctccc tcccaagccc cgcaactgct 

5 0461 ggccctctcc tggtgctgtt caccctcaac ttcaccatca ccaacctgaa gtatgaggag 
5 0521 gacatgcatc gccctggctc caggaagttc aacaccactg agagggtcct gcagactctg 
5 0581 cttggtccta tgttcaagaa caccagtgtt ggccttctgt actctggctg cagactgacc 
5 0641 ttgctcaggt ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 

10 5 0701 cttgacccca aaagccctgg actgnacagn gagcngctnt actgggagct nagccanctg 

5 07 61 accaannnca tcnnngagct gggnccctac accctggaca ggnacagtct ctatgtcaat 
5 0821 ggtttcaccc atcnganctc tgngcccacc accagcactc ctgggacctc cacagtgnac 
50881 ntnggnacct cngggactcc atcctccntc cccngccnca cancnnctgn ccctctcctg 
5 0941 ntn cent tea ccntcaactt naccatcacc aacctgeant angngganna catgcnncnc 

15 51001 ccnggntcca ggaagttcaa caccacngag agagtccttc agggtctget caggcctgtg 

51061 ttcaagaaca ccagtgttgg ccctctgtac tetggctgea gactgacctt gctcaggccc 
51121 aagaaggatg gggcagccac caaagtggat gccatctgca cctaccgccc tgatcccaaa 
51181 agccctggac tggacagaga gcagctatac tgggagctga gccagctaac ccacagcatc 
51241 actgagctgg gcccctacac ccaggacagg gacagtctct atgtcaatgg cttcacccat 

20 513 01 eggagctctg tgccaaccac cagtattcct gggacctctg cagtgcacct ggaaaccact 

51361 gggactccat cctccttccc cggccacaca gagcctggcc ctctcctgat accattcact 
51421 ttcaacttta ccatcaccaa ectgegttat gaggaaaaca tgcaacaccc tggttccagg 
51481 aagttcaaca ecaeggagag ggttctgcag ggtctgetea cgcccttgtt caagaacacc 
51541 agtgttggcc ctctgtactc tggctgeaga ctgaccttgc tcagacctga gaagcaggag 

25 51601 gcagccactg gagtggacac catctgtacc caccgcgttg atcccatcgg acctggactg 

51661 gacagagagc ggctatactg ggagctgagc cagctgaeca acagcatcac agagctggga 
51721 ccctacaccc tggataggga cagtctctat gtcgatggct tcaacccttg gagctctgtg 
51781 ccaaccacca gcactcctgg gacctccaca gtgcacctgg caacctctgg gactccatcc 
51841 cccctgcctg gccacacagc ccctgtccct ctcttgatac cattcaccct caactttacc 

30 51901 atcaccgacc tgcattatga agaaaacatg caacaccctg gttccaggaa gttcaacacc 

51961 aeggagaggg ttctgcaggg tetgetcaag cccttgttca agagcaccag cgttggccct 
52021 ctgtactctg getgeagact gaccttgctc agacctgaga aacatggggc agccactgga 
52081 gtggacgcca tctgcaccct ccgccttgat cccactggtc ctggactgga cagagagegg 
52141 ctatactggg agetgageca gctgaccaac agcatcacag agctgggacc ctacaccctg 

35 522 01 gatagggaca gtctctatgt caatggcttc aacccttgga gctctgtgcc aaccaccagc 

522 61 actcctggga cctccacagt gcacctggca acetctggga ctccatcctc cctgcctggc 
52321 cacacaactg ctggccctct cctggtgccg ttcaccctca acttcaccat caccaacctg 
52381 aagtacgagg aggacatgea ttgccctggc tccaggaagt tcaacaccac agagagagtc 
52441 ctgcagagtc tgcatggtcc catgttcaag aacaccagtg ttggccctct gtactctggc 

40 525 01 tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 

52561 tgcacccacc gtcttgaccc caaaagccct ggactgnaca gngagenget ntactgggag 
52621 ctnagccanc tgaccaannn catennngag ctgggnccct acaccctgga caggnacagt 
52 681 ctctatgtca atggtttcac ccatcnganc tctgngccca ccaccagcac tcctgggacc 
52 741 tccacagtgn aciitnggnac ctcngggact ccatcctccn tccccngccn cacancnnct 

45 52 801 gnccctctcc tgntnccntt caccntcaac ttnaccatca ccaacctgca ntangnggan 

52 8 61 nacatgenne ncccnggntc caggaagttc aacaccacng agngngtnet gcagggtctg 
52921 ctnrmncccn tnttcaagaa cnccagtgtn ggccntctgt actctggctg cagactgacc 
52981 tnnctcaggn cngagaagna tggngcagee actggantgg atgccatctg canccaccnn 
5 3 041 cntnanccca aaagncctgg actgnacagn gagcngctnt actgggagct nagccanctg 

50 53101 accaacagca tcacagagct gggaccctac accctggata gggacagtct ctatgtcaat 

53161 ggtttcaccc atcgaagctc tatgcccacc accagtattc ctgggacctc tgcagtgcac 
53221 ctggaaacct ctgggactcc agcctccctc cctggccaca cagcccctgg ccctctcctg 
53281 gtgecattea ccctcaactt cactatcacc aacctgeagt atgaggagga catgcgtcac 
5 3 341 cctggttcca ggaagttcaa caccacggag agagtcctgc agggtctget caagcccttg 

55 5 3401 ttcaagagca ccagtgttgg ccctctgtac tetggctgea gactgacctt gctcaggcct 

53461 gaaaaacgtg gggcagccac cggcgtggac accatctgca ctcaccgcct tgaccctcta 
5 3 521 aaccctggac tgnacagnga gengctntae tgggagctna gccanctgac caannncatc 
53581 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
53641 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 

60 53701 gggactccat cctccntccc cngccncaca ncnnctgncc ctctcctgnt nccnttcacc 

53 7 61 ntcaacttna ccatcaccaa ectgeantan gnggannaca tgcnncnccc nggntccagg 
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53821 
53881 
53941 
54001 
54061 
54121 
54181 
54241 
54301 
54361 
54421 
54481 
54541 
54601 
54661 
54721 
54781 
54841 
54901 
54961 
55021 
55081 
55141 
55201 
55261 
55321 
55381 
55441 
55501 
55561 
55621 
55681 
55741 
55801 
55861 
55921 
55981 
56041 
56101 
56161 
56221 
56281 
56341 
56401 
56461 
56521 
56581 
56641 
56701 
56761 
56821 
56881 
56941 
57001 
57061 
57121 
57181 
57241 
57301 
57361 
57421 



aagttcaaca 
agtgtnggcc 
gcagccactg 
nacagngagc 
ccctacaccc 
ccaaccacca 
tccctgcctg 
atcaccaacc 
acggagcggg 
ctgtactctg 
atggatgcca 
ctntactggg 
gacaggnaca 
actcctggga 
cncacancnn 
cantangngg 
ctgcagggtc 
tgcagactga 
tgcanccacc 
ctnagccanc 
ctctatgtca 
tccacagtgt 
ggccctctcc 
aacatgcaac 
ctcacgccct 
ttgctcagac 
gttgatccca 
accaannnca 
ggtttcaccc 
ntnggnacct 
ntnccnttca 
ccnggntcca 
ttcaagaacn 
gagaagnatg 
agncctggac 
nnngagctgg 
cggagctctg 
gggactccat 
ctcaacttta 
aagttcaaca 
agtgttggcc 
gcagccactg 
nacagngagc 
ccctacaccc 
cccaccacca 
tccntccccn 
atcaccaacc 
acngagngng 
ctgtactctg 
ntggatgcca 
ctntactggg 
gacaggnaca 
actcctggga 
cacacagccc 
cagtatgagg 
ctgcagggtc 
tgcagactga 
tgcctctacc 
ctaagccagc 
ctctatgtca 
tccacagtgt 



ccacngagng 
ntctgtactc 
gantggatgc 
ngctntactg 
tggacaggna 
gcactcctgg 
gccacacagc 
tgcattatga 
tcctgcaggg 
gctgcagact 
tctgcagcca 
agctnagcca 
gtctctatgt 
cctccacagt 
ctgnccctct 
annacatgcn 
tgctnnnncc 
cctnnctcag 
nncnt nance 
tgaccaannn 
atggtttcac 
actgggcaac 
tgataccatt 
accctggttc 
tgttcaagaa 
ctgagaagca 
teggacctgg 
tennngaget 
atcnganctc 
cngggactcc 
ccntcaactt 
ggaagttcaa 
ccagtgtngg 
gngcagccac 
tgnacagnga 
gnccctacac 
tgccaaccac 
cctccctgcc 
ccatcaccaa 
ecaeggagag 
ctctgtactc 
gagtggacgc 
ngctntactg 
tggacaggna 
gcactcctgg 
gccncacanc 
tgcantangn 
tnctgcaggg 
gctgcagact 
tctgcancca 
agctnagcca 
gtctctatgt 
cctccacagt 
ctgtccctct 
aggacatgea 
tgettagtec 
cctctctcag 
accctaatcc 
tgacccacaa 
atggtttcac 
actgggcaac 



ngtnctgcag 
tggctgeaga 
catctgcanc 
ggagctnagc 
cagtctctat 
gacctccaca 
ccctgtccct 
agaaaacatg 
tctgcttggt 
gaccttgctc 
ccgtcttgac 
nctgaccaan 
caatggtttc 
gnacntnggn 
cctgntnccn 
ncncccnggn 
cntnttcaag 
gncngagaag 
caaaagncct 
catennngag 
ccatcagaac 
cactgggact 
cactttcaac 
caggaagttc 
caccagtgtt 
ggaggcagee 
actgnacagn 
gggnccctac 
tgngcccacc 
atcctccntc 
naccatcacc 
caccacngag 
ccntctgtac 
tggantggat 
gengctntae 
cctggacagg 
cagcagtcct 
tggccacaca 
ectgeattat 
ggttctgcag 
tggctgeaga 
catctgcacc 
ggagctnagc 
cagtctctat 
gacctccaca 
nnctgnccct 
ggannacatg 
tctgctnnnn 
gacctnnctc 
cenncntnan 
nctgaccaan 
caatggtttc 
gcacctggca 
cttgatacca 
tcgccctgga 
cattttcaag 
geccgagaag 
caaaagacct 
catcactgag 
ccatcagaac 
cactgggact 



ggtctgctnn 
ctgacctnnc 
caccnncntn 
canctgacca 
gtcaatggtt 
gtgcacctgg 
ctcttgatac 
caacaccctg 
cccatgttca 
aggectgaga 
cccaaaagcc 
nncatcnnng 
acccatcnga 
acctcnggga 
ttcaccntca 
tccaggaagt 
aacnecagtg 
natggngcag 
ggactgnaca 
ctgggnccct 
tctgtgccca 
ccatcctcct 
tttaccatca 
aacaccacgg 
ggccctctgt 
actggagtgg 
gagengctnt 
accctggaca 
accagcactc 
cccngccnca 
aacctgeant 
ngngtnctgc 
tetggctgea 
gccatctgca 
tgggagctna 
nacagtctct 
gggacctcca 
gcccctgtcc 
gaagaaaaca 
ggtctgetea 
ctgaccttgc 
ctccgccttg 
canctgacca 
gtcaatggtt 
gtgnacntng 
ctcctgntnc 
cnncncccng 
cccntnttca 
aggncngaga 
cccaaaagnc 
nncatcnnng 
acccatcgga 
acctctggga 
ttcaccctca 
tctaggaagt 
aactccagtg 
gatggggcag 
gggctggaca 
ctgggcccct 
tctgtgccca 
ccatcctcct 



nncccntntt 
tcaggncnga 
ancccaaaag 
annncatenn 
ttcaccctcg 
caacctctgg 
cattcaccct 
gttccaggaa 
agaacacaag 
agaatggggc 
ctggactgna 
agctgggncc 
nctctgngcc 
ctccatcctc 
acttnaccat 
tcaacaccac 
tnggecntet 
ccactggant 
gngagenget 
acaccctgga 
ccaccagtac 
tccccggcca 
ccaacctgca 
agagggttct 
actctggctg 
acaccatctg 
actgggagct 
ggnacagtct 
ctgggacctc 
canennctgn 
angngganna 
agggtctget 
gactgacctn 
nccaccnncn 
gccanctgac 
atgtcaatgg 
cagtgcacct 
ctctcttgat 
tgcaacaccc 
agcccttgtt 
tcagacctga 
atcccactgg 
annncatenn 
tcacccatcn 
gnacctcngg 
cnttcaccnt 
gntccaggaa 
agaaenccag 
agnatggngc 
ctggactgna 
agctgggncc 
cctctgtgcc 
ctccatcctc 
actttaccat 
tcaacaccac 
ttggccctct 
caactggaat 
gagagcagct 
acagcctgga 
ccaccagtac 
tccccggcca 



caagaacncc 
gaagnatggn 
ncctggactg 
ngagctgggn 
gagctctgtg 
gactccatcc 
caactttacc 
gttcaacacc 
tgteggcett 
agccactgga 
cagngageng 
ctacaccctg 
caccaccagc 
cntccccngc 
caccaacctg 
ngagngngtn 
gtactctggc 
ggatgecate 
ntactgggag 
caggnacagt 
tcctgggacc 
cacagagcct 
ttatgaggaa 
gcagggtctg 
cagactgacc 
tacccaccgc 
nagecanctg 
ctatgtcaat 
cacagtgnac 
ccctctcctg 
catgcnncnc 
nnnncccntn 
nctcaggncn 
tnancccaaa 
caannncatc 
tttcacccat 
ggcaacctct 
accattcacc 
tggttccagg 
caagagcacc 
gaaacatggg 
tcctggactg 
ngagctgggn 
ganctctgng 
gactccatcc 
caacttnacc 
gttcaacacc 
tgtnggccnt 
agccactgga 
cagngageng 
ctacaccctg 
caccaccagc 
cctgcctggc 
caccaacctg 
agagagggtc 
gtactctggc 
ggatgctgtc 
gtactgegag 
cagggacagt 
tcctgggacc 
cacancnnct 
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58561 
20 5 8 621 



57481 gnccctctcc tgntnccntt caccntcaac ttnaccatca ccaacctgca ntangnggan 
57541 nacatgcnnc ncccnggntc caggaagttc aacaccacng agngngtnct gcagggtctg 
57601 ctnnnncccn tnttcaagaa cnccagtgtn ggccntctgt actctggctg cagactgacc 
57661 tnnctcaggn cngagaagna tggngcagcc actggantgg atgccatctg canccaccnn 
5 57721 cnt nance ca aaagncctgg actgnacagn gagengctnt actgggagct nagecanctg 

57781 accaannnca tennngaget gggnccctac accctggaca ggnacagtct ctatgtcaat 
57841 ggtttcaccc attggagctc tgggctcacc accagcactc cttggacttc cacagttgac 
57901 cttggaacct cagggactcc atcccccgtc cccagcccca caactgetgg ccctctcctg 
57961 gtgecattea ccctaaactt caccatcacc aacctgeagt atgaggagga catgcatcgc 
10 5 8 021 cctggatcta ggaagttcaa cgccacagag agggtcctgc agggtctget tagtcccata 

58081 ttcaagaaca ccagtgttgg ccctctgtac tetggctgea gactgacctt gctcagacct 
58141 gagaagcagg aggcagccac tggagtggac accatctgta cccaccgcgt tgatcccatc 
582 01 ggacctggac tgnacagnga gengctntae tgggagctna gccanctgac caannncatc 
582 61 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
15 58321 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 

58381 gggactccat cctccntccc cngccncaca ncnnctgncc ctctcctgnt nccnttcacc 
5 8441 ntcaacttna ccatcaccaa ectgeantan gnggannaca tgcnncnccc nggntccagg 
58501 aagttcaaca ccacngagng ngtnctgcag ggtctgctnn nncccntntt caagaacncc 
agtgtnggcc ntctgtactc tggctgeaga ctgacctnnc tcaggncnga gaagnatggn 
gcagccactg gantggatgc catctgcanc caccnncntn ancccaaaag ncctggactg 
5 8681 nacagngagc ngctntactg ggagctnagc canctgacca annncatenn ngagctgggn 
58741 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcg gagctttggg 
588 01 ctcaccacca gcactccttg gacttccaca gttgaccttg gaacctcagg gactccatcc 
58861 cccgtcccca gccccacaac tgctggccct ctcctggtgc cattcaccct aaacttcacc 
25 58921 atcaccaacc tgcagtatga ggaggacatg catcgccctg gctccaggaa gttcaacacc 

58981 aeggagaggg tccttcaggg tetgettacg cccttgttca ggaacaccag tgtcagctct 
5 9 041 ctgtactctg gttgeagact gaccttgctc aggectgaga aggatggggc agccaccaga 
59101 gtggatgctg tctgcaccca tcgtcctgac cccaaaagcc ctggactgna cagngageng 
59161 ctntactggg agetnageca nctgaccaan nncatcnnng agctgggncc ctacaccctg 
30 59221 gacaggnaca gtctctatgt caatggtttc acccatcnga nctctgngcc caccaccagc 

5 9281 actcctggga cctccacagt gnacntnggn acctcnggga ctccatcctc cntccccngc 
59341 cncacancnn ctgnccctct cctgntnccn ttcaccntca acttnaccat caccaacctg 
59401 cantangngg annacatgen ncncccnggn tccaggaagt tcaacaccac ngagngngtn 
59461 ctgcagggtc tgctnnnncc cntnttcaag aacnecagtg tnggecntet gtactctggc 
35 59521 tgcagactga cctnnctcag gncngagaag natggngcag ccactggant ggatgecate 

5 9581 tgcanccacc nncntnancc caaaagncct ggactgnaca gngagenget ntactgggag 
5 9 641 ctnagccanc tgaccaannn catennngag ctgggnccct acaccctgga caggnacagt 
59701 ctctatgtca atggtttcac ccattggatc cctgtgccca ccagcagcac tcctgggacc 

5 9761 tccacagtgg accttgggtc agggactcca tcctccctcc ccagccccac aactgetgge 
40 59821 cctctcctgg taccattcac cctcaacttc accatcacca acctgeagta tggggaggac 

59881 atgggtcacc ctggctccag gaagttcaac accacagaga gggtcctgea gggtctgett 
59941 ggtcccatat tcaagaacac cagtgttggc cctctgtact ctggctgcag actgacctct 
60001 ctcaggtccg agaaggatgg agcagccact ggagtggatg ccatctgcat ccatcatctt 
600 61 gaccccaaaa gccctggact gnacagngag cngctntact gggagctnag ccanctgacc 

45 60121 aannncatcn nngagctggg nccctacacc ctggacaggn acagtctcta tgtcaatggt 

60181 ttcacccatc nganctctgn gcccaccacc agcactcctg ggacctccac agtgnacntn 
60241 ggnacctcng ggactccatc ctccntcccc ngccncacan cnnctgnccc tctcctgntn 
60301 ccnttcaccn tcaacttnac catcaccaac ctgeantang nggannacat gcnncncccn 
603 61 ggntccagga agttcaacac cacngagngn gtnetgeagg gtctgctnnn ncccntnttc 

50 6 0421 aagaaencca gtgtnggccn tctgtactct ggctgeagae tgacctnnct caggncngag 

60481 aagnatggng cagccactgg antggatgee atctgcancc acenncntna ncccaaaagn 
60541 cctggactgn acagngagen gctntactgg gagctnagee anctgaccaa nnncatcnnn 
60601 gagctgggnc cctacaccct ggacaggnac agtctctatg tcaatggttt cacccatcag 
60661 acctttgcgc ccaacaccag cactcctggg acctccacag tggaccttgg gacctcaggg 

55 60721 actccatcct ccctccccag ccctacatct gctggccctc tcctggtgcc attcaccctc 

60781 aacttcacca tcaccaacct gcagtacgag gaggacatgc atcacccagg ctccaggaag 

6 0 841 ttcaacacca eggagegggt ectgeagggt ctgcttggtc ccatgttcaa gaacaccagt 
60901 gtcggccttc tgtactctgg ctgcagactg accttgctca ggcctgagaa gaatggggca 
60961 gccaccagag tggatgctgt ctgcacccat cgtcctgacc ccaaaagccc tggactgnac 

60 61021 agngagenge tntactggga getnagecan ctgaccaann neatennnga gctgggnccc 

61081 tacaccctgg acaggnacag tctctatgtc aatggtttca cccatcngan ctctgngccc 
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61141 
61201 
61261 
61321 
61381 
61441 
61501 
61561 
61621 
61681 
61741 
61801 
61861 
61921 
61981 
62041 
62101 
62161 
62221 
62281 
62341 
62401 
62461 
62521 
62581 
62641 
62701 
62761 
62821 
62881 
62941 
63001 
63061 
63121 
63181 
63241 
63301 
63361 
63421 
63481 
63541 
63601 
63661 
63721 
63781 
63841 
63901 
63961 
64021 
64081 
64141 
64201 
64261 
64321 
64381 
64441 
64501 
64561 
64621 
64681 
64741 



accaccagca 
ntccccngcc 
accaacctgc 
gagagggttc 
tactctggct 
gacgccatct 
tactgggagc 
agggacagtc 
cctgggacct 
acagcccctg 
tatgaggtgg 
cagggtctgc 
agactgacct 
actcaccgcc 
agcaaactga 
tatgtcaatg 
acagtacacc 
cctctcctgg 
atgcgacacc 
aggcccttgt 
ctcaggccag 
gaccctcaaa 
cacggcatca 
ttcactcatt 
ggaacctctg 
ccnttcaccn 
ggntccagga 
aagagcacca 
aaggacggag 
cctgggctag 
gagctgggac 
agctctgtgc 
actccatcat 
aattttacca 
ttcaacacca 
gtcagctctc 
gccaccagag 
agagagcggc 
tacaccctgg 
accaccagaa 
ctgtctggac 
actaacctgc 
gagagagtcc 
tactctggct 
gatgccatct 
tactgggagc 
agggacagtc 
cctgggaccc 
tcggctgcca 
tatgaggaga 
cagggcctgc 
agactgactt 
acccaccacc 
agccagctga 
tttgtcaatg 
acagtgtatc 
catctcctga 
atgtggcctg 
cccttgttca 
aggccagaga 
cccacaggcc 



ctcctgggac 
ncacagcccc 
attatgaaga 
tgcagggtct 
gcagactgac 
gcaccctccg 
tgagccagct 
tctatgtcaa 
ctgcagtgca 
gccctctcct 
acatgcgtca 
tcaagccctt 
tgctcaggcc 
ttgaccctct 
cccgtggcat 
gtttcaccca 
taggaacctc 
tgccattcac 
ctggctccag 
tcaagaatac 
agaaggacaa 
gccctggact 
ctgagctggg 
ggagccccat 
ggatcccacc 
tcaacttnac 
agttcaacac 
gtgttggccc 
tagccaccag 
acagacagca 
cctacaccct 
ccaccaccag 
ccctccctgg 
tcactaacct 
cggagagggt 
tgtactctgg 
tggatgctgt 
tgtactggaa 
acaggcacag 
ctcctgatac 
ctacgaccgc 
ggtatgagga 
ttcagggtct 
gcagactgac 
gcacctaccg 
tgagccagct 
tctatgtcaa 
ccacagtgga 
gccctctcct 
acatgcagca 
tcaggtccct 
tgctcaggcc 
ctgaccccaa 
cccacaatat 
gtttcactca 
tgggagcatc 
tactattcac 
gctccaggaa 
agaacaccag 
aagatgggga 
ctgggctgga 



ctccacagtg 
tgtccctctc 
aaacatgcaa 
gctcaagccc 
cttgctcaga 
ccttgatccc 
gaccaacagc 
tggcttcacc 
cctggaaacc 
ggtgccattc 
ccctggttcc 
gttcaagagc 
tgaaaaacgt 
aaaccctgga 
catcgagctg 
tcggaacttt 
tgaaactcca 
cctcaacttc 
gaagttcaat 
cagtatcggc 
ggcagccacc 
gaacagagag 
cccctacacc 
accgaccacc 
ttccctccct 
catcaccaac 
cacngagagg 
tctgtattct 
agtggacgcc 
gctatactgg 
ggatagggac 
cactcctggg 
ccccacagcc 
gcagtatgag 
ccttcagggt 
ttgcagactg 
ctgcacccat 
gctgagccag 
tctctatgtc 
ctccacaatg 
cagccctctc 
gaacatgcat 
gctcaggcct 
cttgctcagg 
ccctgatccc 
aacccacagc 
tggtttcaca 
cctgggaaca 
ggtgctattc 
ccctggctcc 
gttcaagagc 
tgaaaaggat 
aagccctagg 
cactgagctg 
tcggagctct 
taagactcca 
cctcaacttc 
gttcaacact 
tgttggccct 
agccaccgga 
cagagagcag 



nacntnggna 
ttgataccat 
caccctggtt 
ttgttcaaga 
cctgagaaac 
actggtcctg 
gttacagagc 
cagcggagct 
tctgggactc 
accctcaact 
aggaagttca 
accagtgttg 
ggggcagcca 
ctggacagag 
ggcccctacc 
gtgcccatca 
tcctccctac 
accatcacca 
accacggaga 
cctctgtact 
agagtggatg 
cagctgtact 
ctggacaggg 
agcactcctg 
gaaactacan 
ctgcantang 
gttctgcagg 
ggctgcagac 
atctgcaccc 
gagctgagcc 
agtctctatg 
actttcacag 
actggccctg 
gaggacatgc 
ctgcttatgc 
accttgctca 
cgtcctgacc 
ctgacccacg 
aatggtttca 
cacctggcaa 
ctggtgctat 
caccctggct 
gtgttcaaga 
cccaagaagg 
aaaagccctg 
atcactgagc 
cagcggagct 
tctgggactc 
actctcaact 
aggaagttca 
accagtgttg 
gggacagcca 
ctggacagag 
ggccactatg 
gtgtccacca 
gcctcgatat 
accatcacta 
acagagaggg 
ctgtactctg 
gtggatgcca 
ctgtatttgg 



cctcngggac 
tcaccctcaa 
ccaggaagtt 
gcaccagcgt 
atggggcagc 
gactggacag 
tgggccccta 
ctgtgccaac 
cagcctccct 
tcactatcac 
acaccacgga 
gccctctgta 
ccggcgtgga 
agcagctata 
tcctggacag 
ccagcactcc 
ctagacccat 
acttgcagta 
gggtcctaca 
ccagctgcag 
ccatctgtac 
gggagctgag 
acagtctcta 
ggacctccat 
cnnctgnccc 
nggannacat 
gtctgctcaa 
tgaccttgct 
accgccctga 
agctgaccca 
tcaatggttt 
tacagccgga 
tcctgctgcc 
atcgccctgg 
ccttgttcaa 
ggcctgagaa 
ccaaaagccc 
gcatcactga 
cccatcagag 
cctcgagaac 
tcacaattaa 
ctagaaagtt 
acaccagtgt 
atggggcagc 
gactggacag 
tgggccccta 
ctgtgcccac 
cagtttctaa 
tcaccatcac 
acaccacgga 
gccctctgta 
ctggagtgga 
agcagctgta 
ccctggacaa 
ccagcactcc 
ttggcccttc 
acctgcggta 
tccttcaggg 
gctccaggct 
tctgcaccca 
agctgagcca 



tccatcctcc 
ctttaccatc 
caacaccacg 
tggccctctg 
cactggagtg 
agagcggcta 
caccctggac 
caccagtatt 
ccctggccac 
caacctgcag 
gagagtcctg 
ctctggctgc 
caccatctgc 
ctgggagctg 
aggcagtctc 
tgggacctcc 
agtgcctggc 
tgaggaggcc 
gggtctgctc 
actgaccttg 
ccaccaccct 
ccagctgacc 
tgtcgatggt 
agtgaacctg 
tctcctgntn 
gcnncncccn 
gcccttgttc 
caggcctgag 
ccccaaaatc 
cagcatcact 
cacccagcgg 
aacctctgag 
attcaccctc 
ctccaggaag 
gaacaccagt 
ggatggggca 
tggactggac 
gctgggcccc 
ctctatgacg 
tccagcctcc 
cttcaccatc 
taacaccacg 
tggccctctg 
caccaaagtg 
agagcagcta 
caccctggac 
cactagcatt 
acctggtccc 
caacctgcgg 
gagggtcctt 
ctctggctgc 
tgccatctgc 
ttgggagctg 
cgacagcctc 
tgggaccccc 
agctgccagc 

tgaggagaac 

cctgctaagg 
gaccttgctc 
ccgccctgac 
gctgacccac 
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648 01 agcatcactg agctgggccc ctacacactg gacagggaca gtctctatgt caatggtttc 
648 61 acccatcgga gctctgtacc caccaccagc accggggtgg tcagcgagga gccattcaca 
64921 ctgaacttca ccatcaacaa cctgcgctac atggcggaca tgggccaacc cggctccctc 
64981 aagttcaaca tcacagacaa cgtcatgaag cacctgctca gtcctttgtt ccagaggagc 
5 65 041 agcctgggtg cacggtacac aggctgcagg gtcatcgcac taaggtctgt gaagaacggt 

65101 gctgagacac gggtggacct cctctgcacc tacctgcagc ccctcagcgg cccaggtctg 
65161 cctatcaagc aggtgttcca tgagctgagc cagcagaccc atggcatcac ccggctgggc 
65221 ccctactctc tggacaaaga cagcctctac cttaacggtt acaatgaacc tggtctagat 
65281 gagcctccta caactcccaa gccagccacc acattcctgc ctcctctgtc agaagccaca 
10 65341 acagccatgg ggtaccacct gaagaccctc acactcaact tcaccatctc caatctccag 

65401 tattcaccag atatgggcaa gggctcagct acattcaact ccaccgaggg ggtccttcag 
65461 cacctgctca gacccttgtt ccagaagagc agcatgggcc ccttctactt gggttgccaa 
65521 ctgatctccc tcaggcctga gaaggatggg gcagccactg gtgtggacac cacctgcacc 
65581 taccaccctg accctgtggg ccccgggctg gacatacagc agctttactg ggagctgagt 

15 65641 cagctgaccc atggtgtcac ccaactgggc ttctatgtcc tggacaggga tagcctcttc 

65701 atcaatggct atgcacccca gaatttatca atccggggcg agtaccagat aaatttccac 
65761 attgtcaact ggaacctcag taatccagac cccacatcct cagagtacat caccctgctg 
65 821 agggacatcc aggacaaggt caccacactc tacaaaggca gtcaactaca tgacacattc 
65881 cgcttctgcc tggtcaccaa cttgacgatg gactccgtgt tggtcactgt caaggcattg 

20 65941 ttctcctcca atttggaccc cagcctggtg gagcaagtct ttctagataa gaccctgaat 

66001 gcctcattcc attggctggg ctccacctac cagttggtgg acatccatgt gacagaaatg 
66061 gagtcatcag tttatcaacc aacaagcagc tccagcaccc agcacttcta cctgaatttc 
66121 accatcacca acctaccata ttcccaggac aaagcccagc caggcaccac caattaccag 
66181 aggaacaaaa ggaatattga ggatgcgctc aaccaactct tccgaaacag cagcatcaag 

25 66241 agttattttt ctgactgtca agtttcaaca ttcaggtctg tccccaacag gcaccacacc 

6 63 01 ggggtggact ccctgtgtaa cttctcgcca ctggctcgga gagtagacag agttgccatc 
663 61 tatgaggaat ttctgcggat gacccggaat ggtacccagc tgcagaactt caccctggac 
66421 aggagcagtg tccttgtgga tgggtattct cccaacagaa atgagccctt aactgggaat 
66481 tctgaccttc ccttctgggc tgtcatcctc atcggcttgg caggactcct gggactcatc 

30 66541 acatgcctga tctgcggtgt cctggtgacc acccgccggc ggaagaagga aggagaatac 

6 6601 aacgtccagc aacagtgccc aggctactac cagtcacacc tagacctgga ggatctgcaa 
66661 tgactggaac ttgccggtgc ctggggtgcc tttcccccag ccagggtcca aagaagcttg 
66721 gctggggcag aaataaacca tattggtcgg aaaaaaaaaa aaaaa 

35 SEQ ID NO. 3 

hk5 amino acid 

MATARPPWMWVLCALITALLLGVTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDDSSSRIINGSD 
CDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKICVFRVRLGHYSLSPVYESGQQMFQGVKSIPHPG 
40 YSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSGWGTTKSPQVHFPKVLQCLNISVLSQKR 
CEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGPVVCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQET 
IQANS 



45 



SEQ ID NO. 4 
KLK5 CDS 



50 



55 



ggtgtctgtg 
tgctagactc 
gcccgtcctt 
aagacccccc 
agagcatgtt 
tgggagcaac 
cagccgcatc 
gttgctaagg 
cacggccgcc 
accagtttat 
ctactcccac 
tcccactaaa 



cgtcctgcac 
ctatcttctg 
gtggttcctc 
tggatgtggg 
ctcgccaaca 
caggacctgg 
atcaatggat 
cccaaccagc 
cactgcagga 
gaatctgggc 
cctggccact 
gatgtcagac 



ccacatcttt 
aattctatag 
tctacttggg 
tgctctgtgc 
atgatgtttc 
gagctggggc 
ccgactgcga 
tctactgcgg 
agaaagtttt 
agcagatgtt 
ctaacgacct 
ccatcaacgt 



ctctgtcccc 
tgcctgggtc 
gaaatcaggt 
tctgatcaca 
ctgtgaccac 
cggggaagac 
tatgcacacc 
ggcggtgttg 
cagagtccgt 
ccagggggtc 
catgctcatc 
ctcctctcat 



tccttgccct 
tcagcgcagt 
gcagcggcca 
gccttgcttc 
ccctctaaca 
gcccggtcgg 
cagccgtggc 
gtgcatccac 
ctcggccact 
aaatccatcc 
aaactgaaca 
tgtccctctg 



gtctggaggc 
gccgatggtg 
tggctacagc 
tgggggtcac 
ccgtgccctc 
atgacagcag 
aggccgcgct 
agtggctgct 
actccctgtc 
cccaccctgg 
gaagaattcg 
ctgggacaaa 
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gtgcttggtg 
ccagtgcttg 
gatagatgac 
ttctgggggg 
cccttgtgcc 
gatccaggaa 
cccacctgct 
gttgagaatg 
cccccacatt 
tccagggcgg 
catcccttct 
aaaaaaa 



tctggctggg 
aatatcagcg 
accatgttct 
cctgtggtct 
cggcccaaca 
accatccagg 
gcagggacag 
ttcatctctc 
gggctgaccg 
gggttgcgtc 
ctgcagctct 



ggacaaccaa 
tgctaagtca 
gcgccggtga 
gcaatggctc 
gaccgggtgt 
ccaactcctg 
ccctgacact 
cagcccctga 
tgtctctcta 
tcaatctccc 
gacccaaatt 



gagcccccaa 
gaaaaggtgc 
caaagcaggt 
cctgcaggga 
ctacacgaac 
agtcatccca 
cctttcagac 
ccccatgtct 
gttgaaccct 
tggggcactt 
tagtcccaga 



gtgcacttcc 
gaggatgctt 
agagactcct 
ctcgtgtcct 
ctctgcaagt 
ggactcagca 
cctcattcct 
cctggactca 
gggaacaatt 
tcatcctcaa 
aataaactga 



ctaaggtcct 
acccgagaca 
gccagggtga 
ggggagatta 
tcaccaagtg 
caccggcatc 
tcccagagat 
gggtctgctt 
tccaaaactg 
gctcagggcc 
gaagtggaaa 



SEQ ID NO. 5 

KLK5 nucleic acid 



gggcccagag tgaaggcaag agaaggagtt 
tcccctgcct aaaatgcagg gagagggagg 
aagaaagaga gagagagaga gagacagaat 
acagagagcc tgggacacag ggacacacag 
acacaaatgg agacacagag gtgtaaagaa 
aaaggggcag aagcacagtt ttcagggtgg 
tttttttttt tttttgagac ggagtctcgc 
gatctcggct cactgcaagc tccgcctccc 
ccaagtagct gggactacag gcgcccgcca 
gtagagacgg ggtttcaccg ttttagccgg 
gcccgcctcg gcctcccaaa gtgctgggat 
atcatcttct tgactatgct gatgtgacaa 
aatatgcagt ttgggccagg caccgtggct 
agaggtgggt gaatcacttg aggccaggag 
ctctgtcttt actaaaaaaa aaaaaaaaaa 
acctgtaatc ccagctatgc tggaggctga 
gaggttgcag tgggccgaga tcacatcacc 
ctgtctcaaa taaataaata aacaaacgaa 
aaaaaaaaaa tgctgtcaac aaatagagca 
agaactctaa ggtatatttg acaaatcatt 
ggcatagaaa gacagggagg aacagggaga 
acaaggctcc taagacagac aggaggagag 
aaaaagacag agagagagag acagagacag 
gagagagggg tggagagaga cacgagatat 
gaaccacaga gagatggaag aagactctga 
agtatcgagg gtgaacagac agtggtggaa 
tccaggcgcc aagaatagtg acccagagtt 
aggcagggaa ggggctggcc tggcttccgg 
gtagggagtg acattccgga ctgggtgggg 
ggaggagcta ttgctaaggc ccgataggca 
cagtgggtgg ttataactca ggcccggtgc 
aggcacaggc ctgagaagtc tgcggctgag 
gggacagggc aagtgagacc tggtgagggt 
gtgcgtcctg cacccacatc tttctctgtc 
ctcctatctt ctgaattcta tagtgcctgg 
cttgtggttc ctctctacct ggggaaataa 
gctccccgga tcgcctgggc ctcccaaccc 
atggctacag caagaccccc ctggatgtgg 
ctgggggtca caggtaacca gaactctggg 
ctctgcggca ctagagcgcc tgtcccctgg 
gaccgggtga atgtgagtct ctgtctgtac 



gagagctccc tctgcaaagt ggcttgagtc 
cagaaagaca gggaagagga aggggtgggg 
aacacaacta cagaaacaca gagagaacac 
agtcagagag aaaagagaag atagagaaag 
agagagatta acagagtccc agatacacgc 
tgtctatgat catcttcttt tttttttttt 
tctgtcgccc aggctggagt gcagtggcgg 
gggttcacgc cattctcctg cctcagcctc 
ctacgcccgg ctaatttttt tgtattttta 
gatggcctcg atctcctgac ctcgtgatcc 
tacaggcgtg agccaccgcg cccggccatg 
gtacctaaag ccatcagact ctacccttta 
catgcctgta attccagcac tttgggaggc 
tttgagacca gcctggccaa catggtgaaa 
aaaaaaaatc agccgggtgt cgtggggcac 
ggcacgagag tcacttgaac cctggaggcg 
gccctccagc ctgggcgaca gagcaagact 
caagcagttt gttgtacctt agttatatct 
gaagtgaaat aaaggaaaat aaatgggcca 
cagaaccttt aaaaaagaaa gaatcacaga 
cagaaacacc tgtggcccaa ggagaacaaa 
agagagagag tgagtgagag acagacagag 
agagacagag aggcgagagg gatagaaaga 
tgagagagac tcagaaagat agccgaggga 
gaaaaaacca gagacaaaga tggaaagagg 
tgagcaaaat gcagagaaga aagcaagcaa 
ggtgagaagc cagatcctta aggctggggg 
agacccctcc ccattctccg ggccagggag 
ggtgctctgg gggtggagat agggggagca 
cctcattgcc cgggaatgtg ccccagggag 
ccagagccca ggaggaggca gtggccagga 
ctgggagcaa atcccccacc ccctacctgg 
ggctcagcag gcagggaagg agaggtgtct 
ccctccttgc cctgtctgga ggctgctaga 
gtctcagcgc agtgccgatg gtggcccgtc 
ggtaggggag ggaggggaag tgggttaagg 
tctgacattc cccatccagg tgcagcggcc 
gtgctctgtg ctctgatcac agccttgctt 
gtgggagggt tgtgggattg ggaggactgt 
ggaactgtgt gagcctgggc atgactccgg 
ttgtggttgt gcgatcgtat gtggccctgt 



WO 2004/075713 



27/47 



PCT/CA2004/000281 



gactgccacg gtgtgtgtcg gggaggggga 
caggtggcac tgaccctttg aggctgtgtg 
attgtgtgtg gctccacagc tgtgtgggtg 
gtgtttggct gtgtgtggtg acttggcatt 
5 tccctgaggt cccgggattg cgtgcaacaa 
gtgtgctgct tgcaggcgat tatgtgattg 
tttgtgaccg tgtgactacc tgaagctctg 
tctgtgtgag gccgtgtaaa tgctactgta 
ttctgtctct gcctggaggg atagagggtg 
10 caggtgactg acttgcagtg tgtgcctgtg 
tgtgcacaca cggcatctgt gcgtggcact 
gctaggctgc ccgggagcgt gtgtacctgg 
gaggcaacat gggcgtgtct gcagaactgc 
gcgtggttct tggggtgagt tcgtgaatga 
15 gaaccaggcc gggcgcggtg gctcacgcct 
ggcggatcac ctgaggtcgg gagatcgagg 
ctactaaaaa tacaaaaaat tagctggtgt 
gggagactgg ggcagaaaaa tcgcttgaac 
tcgcgccatt gcactccagc ctgggcaaca 
20 gaaaaaaaaa agggtaagaa ccagtgaatg 
atgcatgtag tctgtaggtc tgtgtgtgag 
gttttcatct gagaattcag aaacctaggc 
ctgagccctt ctttcctggt cctgctttcg 
ccacctcctt tcctcaacca cgcccctagg 
25 cacccctttg ggccaggctc caccccctat 
agtcagagct tttttttttt tttttttgga 
tgcagtggcg tgatctcggc tcactgcaac 
gcctccacct cctgagtagc tgggattaca 
gtgtctttag tagagacagg gtttcacctt 
30 tcaggtgatc cgcccacctc ggcctcccag 
ccccagccca aagtcagagc tctttatagg 
ctaactaagt caattccaaa ccccttcctg 
tgaccccact tcttgagacc agttccatcc 
gctccagccc ccacagcttt ggcactaccc 
35 tttaccctca catgtagttc tagccaattc 
gtaaccctac ctgagcctgg gctctgtcct 
ctcttattct ccaggccctg cccctgcccc 
ggtctggcct cttgagtctg aaacccaccc 
caacccattt tccgttccca gagcatgttc 
40 cctctaacac cgtgccctct gggagcaacc 
cccggtcgga tgacagcagc agccgcatca 
agccgtggca ggccgcgctg ttgctaaggc 
tgcatccaca gtggctgctc acggccgccc 
gaggagggtt ggtggggacg gggaagtggg 
45 tcatggaggt gagggctggt ggggacgggg 
gggttggtgg ggatgggttg gggatgtggg 
taaggatgga gttttgcggg ggagcaaggt 
tgtggtaggg aatgggaagg agccaaggat 
tgttgaatgg tttgggatgg aggtggaatt 
50 aaatcgggct ggggtggaaa tgaagatagc 
atagaatgaa ggatggggat tggagttttg 
tgagaatgca tatggtgatg gcttctgggt 
ggtttggaat tgtgactggg atggggacag 
ggatggtttg gggaccgggg gtggggatgg 
55 gattggcgtt ggacgtggag atagagatca 
agagttttca gagtccgtct cggccactac 
cagatgttcc agggggtcaa atccatcccc 
aacgacctca tgctcatcaa actgaacaga 
atcaacgtct cctctcattg tccctctgct 



tgccttttcc catatcaggt gactgtgcgg 
tgtggttttg tgattgtgtg tgcatttaag 
aatgcatgta gcactggggg tgttcactgt 
gtatatgact gcaggtatct gcagttcctg 
aagtggtcat caccatggaa agctgtgact 
tggctgagtg tgacgttatg gatgcccgta 
tgtaggggtg actgtatgtg actgtgtgtg 
tgtgtgatgg tgcagctgtg tgtctggagt 
caggggtagc tatctctggg agatgggtgc 
tgcagaagag tatgtggcag tctgaacatc 
gagacactgt ggatgagggt gtgcgatccc 
agacagagct gtatgttagc tgcacctgtg 
gtgcgtgctt ggctgttact gctgttgtgc 
tggtggtgcc agggccatca gcaagggtaa 
gtaatcccag ccctttggga ggccgaggca 
ccagcctgac caacatggag aaccccgtct 
ggtggcgcgt gcctgtaatc ccagctactc 
ccgggaggtg gaggttgcgg tgagccgaga 
agagcgaaac tccgtctcga aagaaaaaaa 
ggcacgggag gactgatgat ggagtggggc 
aggaggagat tgacaggatt gagaaggcat 
ctgctcttcc cctccatgtg gccccctaag 
gaaccctagc tccgcccatg agctctgacc 
ccagactcta gtggaccccg cctaaggcca 
tctgtgggta ccttctagaa cccccttcaa 
gacagtcttg ctctctctcc caggctggag 
ctctgcctcc caggttcaag tgattctcgt 
ggtgcgcgcc accacgcctg gctaattttt 
gttggccagg ctggtctcaa actcccaacc 
agtgctgggg ttacaggcgt gagccaccgc 
agactctaac atgtaaccct gaccctggcc 
cctccagccc tgaccccact cactgaggcc 
ctaaagccct ggtctccctc ccatccccag 
ctgagcttgt ccaggaatcc tgtacccaat 
caggaatctg tgaggtccag ttagagtcca 
tgagcttgag cctgggcttg agaggtgcca 
ctcagcatgt cagacaccca ccctctagct 
ccagcccaag ccccgcctct gagccccgcc 
tcgccaacaa tgatgtttcc tgtgaccacc 
aggacctggg agctggggcc ggggaagacg 
tcaatggatc cgactgcgat atgcacaccc 
ccaaccagct ctactgcggg gcggtgttgg 
actgcaggaa gaagtgagtg ggagttccaa 
ggtgggggtg gggaagtggg ggtgggggtg 
aagtggggtt gggggtgtca tggaaggtga 
agcaggagga ggtcgagttg gggataggac 
gggaggatga ggttggagag gggagagtgt 
gggttggatt tggggttagg agcatatatt 
gggattggct ttagaattgg gggtgggtga 
atggagatag ggttgagatt gggagcagat 
ggtggggttg gagatggttg gatttgggct 
agggaaagaa ttagggttgg gaatgggatg 
gcatgggatt ggagaccaag agggagttga 
gggtggggct ggggctgggt gtggggttgg 
gggttggtgg tgacctgccc catcttcctc 
tccctgtcac cagtttatga atctgggcag 
caccctggct actcccaccc tggccactct 
agaattcgtc ccactaaaga tgtcagaccc 
gggacaaagt gcttggtgtc tggctggggg 
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acaaccaaga gcccccaagg tgagtgtcca ggttcttctt gataccgacc catctctgcc 

gccttccatc tttctccact tctcattgtg ttcctgtttg acagtgcact tccctaaggt 

cctccagtgc ttgaatatca gcgtgctaag tcagaaaagg tgcgaggatg cttacccgag 

acagatagat gacaccatgt tctgcgccgg tgacaaagca ggtagagact cctgccaggt 

5 gaggacacct ctctttattc agcagataca cactgagtgc caactcggta acatggagcg 

ttgccaaatt ctgagaatcc agcaattgcc aagacagtca ggacccctgt tctcacagag 

ctcataccct agagtagtgg tgtttagtag aaataatgct gagctgctta tgtcatttcc 

agttttttag tagccacatt aaaacaggta aaaaaggctg ggcgcagtgg ctcacacctg 

taatcccagc actttgggag gctgaggcag gcagatcacc tttggtcagg agtttgagac 

10 tagcctggcc aacatggcga aactctgtct ctaaaaaaaa atacaaaaat tagcctggca 

tggtggcggg cgcctgtaat ctcagctgct caggaggccg agacacaaga atcacttaaa 

cccaggaggt ggaggttgca gtgagctgag atcgtgccac tcactccaac ctgggagaca 

gagtgacact tttgtctcaa aaagaaaaaa aaaaacaagt aaaaaagaaa caggtgaagt 

taactttaat aacccaatgt atcccaaata caatcatttc aaagtgtaat taatataaaa 

15 caattatgaa tgagatactt tacattcttt tcttgttttc atattaagtc tttgaaagtg 

agtatatatg ttatgctgac agcacatctc aatttggact agctacattt caggtgctca 

gtagccacat gtggctagca gttactgtat tggatggcac ggatctagag ggaaagatca 

gggctgtttt gtatggttgg gcaggttgtg cactgcataa agataccata tctaataggg 

gcactccgtg ttacagatgt cagttttggc agttttcagg cgtgtggtag ttaagtgtct 

20 tgtttcaaca aaatctgtaa tatgacagtt ttctagcaag tgctggtaaa atatcttgag 

gaaggaaaag agaaatctgg taggtatttt tacaagagaa tatttaatac aggggattaa 

ttgcaaagct gctggaaggg ctggaggaac aaagttaaaa aataaaaaac tctgtggtca 

agaatctgca taaatagggc aatttcagag agtggtaaag gttaacccca aaataaaaca 

tggttttagg atagtaaaca ataagggcca atattcaaaa aggtggtcag gggagcctcc 

25 ttggagaggt ggcatttgag cagagaatgg atgacacaaa gaagctaaac tcgtgaagtt 

taaggggaaa gaaaaggcac gtgcaaaggc cctgaggcag taaggaattt ggctgattca 

aagaagaaga ggaaaccaat gcaactggag aacaaaagtg ggggcaacag tagaaagtga 

cgctggaggt gtaggcaggg gcgaatgctc tgcaagtatt tcttggtcac caacacagag 

cttccctatg ttctaatgga agctgtatct gttgaggaag acagaattta aaatcaaact 

30 gttacatcaa ccagcaccct tctctgtatt caggctccca agggatctag aaggacgtaa 

gttaacaagc tctcattagc agggtgtgtg tttcaacagt agttaggaag ctggggattc 

aggagtactc cagtcccatg gctatgaaaa gctcccccca aattgtacaa acctgacaaa 

tgcaacacct ccccagctct ccccatttct tctctgtgcc ctgggtgtgg gggggtgggt 

tgcgaggggg aaaactttta acagaagaaa gcacatctcg gccgggcgtg gtggctcaca 

35 cctgtaatcc caacactttg ggaggccgag gcgggtggat cactaggtca ggagatggag 

accatcctgg ctgacacggt gaaaccctgt ctctactaaa aacacaaaaa attagccggg 

cgtggtggca ggcgcctgta gtcccagcta ctcgggaggc tgaggcagga gaatggcctg 

aacccgggag gcggaacttg cagtgagccg aggttgcacc actgcactcc agcctgggca 

acacagtgag actccgtctc aaaaaaaaaa aaagaaaaga aaagaaatca catctcattc 

40 aagtggtggc atttaaaact atttagcctt tctgtaggca aggttagtat cttgtttttc 

cagacctcaa ggtgtttttt tgtttgtttt ttcataccgg tgtgtggtct gggtgtggcc 

actaaaagct acaagcaaga aataataaca actacaacaa tactaatacc aatagtataa 

aaataatagc atctggctaa ttgctggaca ctgttttaag tggtttgcat gcctcagctc 

attaactcat ttacctgtta ttattggccc tattttacaa acaaggagcc aaggctcaga 

45 gcagttaact aacagcctct caaaagaaac tctgcagaga tattaaattt aaaaaataat 

gagagaaatt aaaccacaag aaagttgaaa tttagaggta caggcagcta agcttgtttg 

ctttgaaaca gtgtctgcta ctgggaaaaa ggcaagtctt ggctttccta ataattgata 

ccaggactct gtaattcata ttttgcatgc atgtaagtaa gaaatgaagc cgggtgcaat 

ggcacatgcc agtaatccca gcactctggg agactgaagt gggaagatca cttgagctca 

50 ggagttcaag accagcctgg gcaactaaaa attaaaaaaa taaaaatact aattgttttt 

attttagtag attttattca taccacttac atcattattg tagtatgtac atatttattt 

cttttctttt cttttctttt cttttttgag acggagtctc gctctgtcac ccaggctgga 

gtgcaatggc accatatcag ctcactgcag catgcgcctc ctgggttcaa gcatttcttc 

cacctcagcc tcccaagtag ctgggataac aggcacccac caccatgcct ggctattttt 

55 ttttttccgt agagatgggg ttccaccatg ttggccaggc tggtcttgaa ctcctgacct 

ccagtgatct gcctgcctcg gcctcccaaa ttgctggtat tacaggtgtg agccaccgtg 

cccaggtggg agatagacat ttctctctac ctcaaacaga ggtccactca agctactttt 

cattttcttc ataaatatta gccgagtggc tattttgcac caggaatggt tccaggtgct 

gtggatatgg catcaggcaa aacagaccaa aaacttcctg ccgcgtggac ctcatgttcc 
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ccaagtggaa 
aattagccgg 
agaattgctt 
cagacagcag 
5 gaagggagga 
gaaaggaagg 
aaagaagaaa 
gttgaagagc 
gacaggaaac 

10 gaagacactg 
ctggcaacta 
gcctccacaa 
tcatccaggg 
gaatggggaa 

15 agagtgagag 
agtagagacg 
gtggacaaca 
gttgaacttg 
aggtgggcga 

20 ccgtctctac 
cagctacttg 
gagctgaggt 
aaaaaaaagc 
tgtgtctcca 

25 gtctgttccc 
cccactttgc 
ggggggcctg 
tgtgcccggc 
caggaaacca 

30 cctgctgcag 
agaatgttca 
cacattgggc 
gggcgggggt 
ccttctctgc 

35 



gacaggcaat 
gtgtggtggc 
gagcccaaac 
cctgggtgac 
aggagagaga 
aaggaaggag 
gaaaagagag 
agtgagtatt 
acaaattagt 
gcaataccag 
atggatagat 
caaaccatta 
gctgagaacc 
ggaaggtgtg 
agaaagggag 
aagattcgag 
ggaatctgga 
gaggccaggc 
atcacttgag 
aaaaaaaata 
ggaggctgag 
caggccactg 
tggatttgga 
tctctgtctt 
atctgcttct 
cacatctctg 
tggtctgcaa 
ccaacagacc 
tccaggccaa 
ggacagccct 
tctctccagc 
tgaccgtgtc 
tgcgtctcaa 
agctctgacc 



aaagagatag 
ttgcacctgt 
gtttgaggct 
aaagcaagac 
aaggaaggaa 
aaagaaagga 
aggaaggaag 
attataggag 
ccaagcgaat 
gagacatttt 
gccaagtgtg 
tccagcttca 
gtatttttgc 
tgtccagtaa 
gcggagagat 
atgtggagag 
agaggaagat 
atggtggctc 
gccaggagtt 
caaaaaatta 
gcaggagaat 
cgctccaacc 
gtgaaatatt 
tttctgcatt 
ccatctatgg 
cctctctcat 
tggctccctg 
gggtgtctac 
ctcctgagtc 
gacactcctt 
ccctgacccc 
tctctagttg 
tctccctggg 
caaatttagt 



ataaatatgt 
agttccagct 
gcggtaagcc 
gtttttgtca 
ggaaggagaa 
agaaagagaa 
gaaagaagga 
ggtaattata 
ggatttctat 
tggttgtcac 
ctgttcaaca 
gatgcccaca 
agaagggagg 
gagaaataag 
acacgatgag 
gaagggtcac 
ggagtggaga 
acgcctgtaa 
cgagaccagc 
gccgggtgtg 
tgcttgaacc 
tgggcaacag 
aataacattc 
tcttcatctc 
gcatctctgg 
gccccccttt 
cagggactcg 
acgaacctct 
atcccaggac 
tcagaccctc 
atgtctcctg 
aaccctggga 
gcactttcat 
cccagaaata 



agtaaattaa 
acttgggagg 
atgactgcac 
gaaagaaaaa 
agaaaggaag 
agaaagaaaa 
aaagagggaa 
gggaggtatg 
tgggagtgat 
aactatatgg 
tgctatgatg 
gtgcccagat 
tataaggatg 
gcctgcacag 
ggagacaggc 
agaccccccc 
gtgacaaatg 
tcccaacact 
ctggccaaca 
gtgatggaca 
cgggagatgg 
agtaagactc 
tccctctctc 
tgtactttcc 
gtctctcatg 
ctctcctgca 
tgtcctgggg 
gcaagttcac 
tcagcacacc 
attccttccc 
gactcagggt 
acaatttcca 
cctcaagctc 
aactgagaag 



aaaaaaaaaa 
ctgaggtggg 
tgctgcactc 
aaaaagagac 
gaaggagaaa 
agaaagaaag 
aaaaatgact 
gggaattgaa 
tctgccccta 

aggggcatta 

cacacggcag 
cgaggaaccc 
ggttggtgga 
gctggagggg 
tggaacagaa 
gaaatgatgt 
gggtctaaag 
ttggaggctg 
tggtgaaacc 
cctgtagtca 
aggctgcagt 
catctcaaaa 
tccttttgcc 
atctctgtgt 
tctccttctg 
gggtgattct 
agattaccct 
caagtggatc 
ggcatcccca 
agagatgttg 
ctgcttcccc 
aaactgtcca 
agggcccatc 



SEQ ID NO. 6 



40 



45 



50 



55 



hk6 amino acid 

MKKLMVVLSLIAAAWAEEQNKLVHGGPCDKTSHPYQAALYTSGHLLCGGVLIHPLWVLTAAHCKKPNLQVFL 
GKHNLRQRESSQEQSSVVRAVIHPDYDAASHDQDIMLLRLARPAKLSELIQPLPLERDCSANTTSCHILGWG 
KTADGDFPDTIQCAYIHLVSREECEHAYPGQITQNMLCAGDEKYGKDSCQGDSGGPLVCGDHLRGLVSWGNI 
PC GSKEKPGVYTNVCRYTNWIQKTIQAK 

SEQ ID NO. 7 
KLK6 nucleic acid 



CDS 147. 

gtcgacccac 
cagcacacag 
cccggaggcc 
ctgcagcctg 
ctcaccccta 
tccatccact 
tggggaagca 
ctgtgatcca 
tggcacgccc 



881 

gcgtccggct 
agggacctac 
tgcagcagga 
ggcagaggag 
ccaagctgcc 
gtgggtcctc 
taaccttcgg 
ccctgactat 
agccaaactc 



ggctggctcg 
gggcagctgt 
gcggccatga 
cagaataagt 
ctctacacct 
acagctgccc 
caaagggaga 
gatgccgcca 
tctgaactca 



ctctctcctg 
tccttccccc 
agaagctgat 
tggtgcatgg 
cgggccactt 
actgcaaaaa 
gttcccagga 
gccatgacca 
tccagcccct 



gggacacaga 
gactcaagaa 
ggtggtgctg 
cggaccctgc 
gctctgtggt 
accgaatctt 
gcagagttct 
ggacatcatg 
tcccctggag 



ggtcggcagg 
tccccggagg 
agtctgattg 
gacaagacat 

ggggtcctta 

caggtcttcc 
gttgtccggg 
ctgttgcgcc 
agggactgct 
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cagccaacac 
ctgacaccat 
accctggcca 
cctgccaggg 
catggggtaa 
gatacacgaa 
cctcccgacc 
ccctcctctc 
tgattctccc 
gaggacttgg 
ggcatctcct 
gtgtctggaa 
cactgcagat 
ggctgttgga 
ccatgcactc 
agggcggccg 



caccagctgc 
ccagtgtgca 
gatcacccag 
tgattctggg 
catcccctgt 
ctggatccaa 
taccacccca 
ctgcccagct 
tggttttacc 
gtcctcggtc 
ctccccaacc 
tcccagctcc 
ttctcacctg 
aagatttaag 
aataaagaat 
c 



cacatcctgg 
tacatccacc 
aacatgttgt 
ggtccgctgg 
ggatcaaagg 
aaaaccattc 
ctggctggtt 
ctgaccctga 
ccagctccat 
ttacccccac 
cttccacacg 
gctgcttact 
taagatgaag 
gtttcacacc 
gaattttatt 



gctggggcaa 
tggtgtcccg 
gtgctgggga 
tatgtggaga 
agaagccagg 
aggccaagtg 
ccagaacgtc 
tgcttaataa 
ccttgcatca 
cactaagaga 
tttgatttct 
gtcggtgtcc 
ataaggatga 
tatgacatac 
atgaaaaaaa 



gacagcagat 
tgaggagtgt 
tgagaagtac 
ccacctccga 
agtctacacc 
accctgacat 
tctcacctag 
acgcagcgac 
ctggggagga 
atacaggaaa 
tcctgcagag 
ccttgggatg 
tacagtctcc 
atggaatagc 
aaaaaaaaaa 



ggtgatttcc 
gagcatgcct 
gggaaggatt 
ggccttgtgt 
aacgtctgca 
gtgacatcta 
accttgcctc 
gtgagggtcc 
cgtgatgagt 
atcccttcta 
gcccagccac 
tacctttctt 
ataaggcagt 
acctgggcca 
aaaaaaaaaa 



SEQ ID NO. 8 
KLK6 nucleic acid 

mRNA join (2 001. .2185,3084. .3135,3559. .360 6,434 6. .4502, 
8122. .8369, 9791. . 9927,11805. .12483) 



CDS join (3567.. 3606, 4346. .4502,8122. .8369,9791. .9927, 11805 .. 11957 ) 

acacttaaaa aatcttctga cttaaaaaaa aaagtatggt gattggaaaa tgtaaatgtg 
catgcgtgct tggcatcaca tttcattggc caggacttcc ctggatgcta aaggtcctca 
aatgccaggc tggggggctg ggacttggtc ccaagggaga tggggaccca gggcacgtct 
gtgagaggag gggcaaggtc agcacaaggc acaggaaggt ctctctgggg caagggatac 
agagaacaga gggatcctgg tccaggtggg agaggtgcag ctctgagttg gggttgaggg 
tgtgggtaca gagaggaagg gaccccccag agagaggagg cagagggata gggcctggtc 
actgggttgt gcaacatcag acttgctgtc tgtgaagata gcacgtcctg agaagaaggt 
gctgaggtca gtggggacca aatgtgagag ggagcacccg gagagtatac tgaataccga 
agtagtcttc atccctggag tgatgggggg tgcacaatgc aagatgacaa ttagattcaa 
tgcaagacaa agaaaagggt tggctgggaa cagtggctca tgcctatggt cccagctcct 
gggaagactg aggcgggagg gtcgcttgag cccaggaggg ttgaggctgc cacgagcaag 
gatcgtgcca ctgcactcca gcctaggcga cagaacaaga ccttgtctca aaagaaaaaa 
gaactttttt ttttaagtta cctgtagtgc ccagcccaag caggtgctga gccagacttc 
attcctatca ttgtccttat tacgcagtga cttccccctc ctcatttctc tccactctgc 
cacgcacaca ccctcaccct ccagcccata ccaaccaccc caaccactgc ctgtggtttc 
ccatgtgcac ccaggccagg cattttcacg gcctttcctc ctgacctacg cctggctcag 
ctttctaggc ccaagttcaa agacacctcc ctaaatcttc ccagatccct ctgctactgc 
ccagcaccac catcttatca cagccccacg tcgttcccaa gtgctctccg atttctgctt 
aactccatgc ctctcgctgt gtgtccgcat ctcatcaata agtcctcaag tcctcttcca 
tcctgctagc ttcctcatcg ctcgggaatc atccccgcta cttcctgggg aaactgactc 
ccttctgggc acacacagtg ctacccccgg ggaaatctaa gaagagaccc aggagaagat 
aagcacggag agtcagagaa tcaaggggaa agaaagggag agaggccggg cacagtggct 
cacacctgta atccagcact ttgggaggcc aaggtgggtg gatcacctga ggtcaggagt 
ttgagaccag cctggccaac atggtgaaac ctcttcccta ctaaaaatac aaaaaacatt 
tagccgggcg tggtggtggg tgcctgtaat cccagctact tgggaagctg aggcaggaga 
actgcttgaa ctcaggaggc ggaggttgca gtgaactgag atcacaccac tgcactccag 
cctgagtgac agagcaagac tccgtcaaaa aaaaagaaag aaagaaagaa aagaaggaag 
gaaagaaaga aggaaggaag gaagggagga agggagagag gaagggagag aggaagggag 
agagagaaaa aaagagggag agagacacaa atacagagac tgagatggga gagagagaga 
gatggaagct ccctcccctc catggccagg gagacagatg gagcaagaga cctcaggggt 
gggcagactt ggaggagaag gaccaggagg atgtggagtg ccgaaatctc cagtcagggc 
caggtgggca gtcagagact gcaaaggagg actgtcagac agggacaaaa ggaagccatt 
gatgtaaccg ccctcccgcc tgcccgccgg aagagaggtt gaggccggag ctgctgggag 
catggcactg gggtgctggg aggcggacaa agcccgattg ttcctgggcc ctttccccat 
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cgcgcctggg cctgctcccc agcccggggc 
tgtagctgtc tccccggctg gctggctcgc 
agcacacaga gggacctacg ggcaggtgtg 
aggctgcttc ccagtgccgg agggctctag 
5 gtggcacaga gagtgctggg ggtgcaggga 
gccggaattt gtcttcagac actttctttc 
gactagaact tcctctgcct cagattcagg 
tttaggtcct agcccctcct ccctcagacc 
gactcaggag tccaggcccc cagcccctcc 

10 ctcctccctc agacccagga gtccaggccc 
caggccccca gcccctcctc cctcagactc 
' tcagacccag gagtccaggc cctcactgca 
cctggtcagg ggtcaccaag agcagagcgt 
gggtaaggag gaaaagggtg tagccagtct 

15 aaaaggacgt tccagaagca tctggggaca 
ctgggggtgt gtgtctggca gtccctgcag 
cttggctctg ccactgcatc tgagtgtctt 
ctttctgcct cctcgtctca aagctgttcc 
ggaggcctgc agcaggtgag atcacagaca 

20 tggccattgc gcacagagcc aggctccgag 
ggccccctat ggtaaccctc tcctgtcgac 
aaagatctaa tcagaatctc accatgggtt 
ttggcactgt tttcgtggat cctcttggaa 
gctagatggg aaggacagag gtcaaggcag 

25 ggatgcagag ggagcagaca gagggatggg 
tgtggcttcc cctctcagga gcggccatga 
ctgcaggtgg ggaaagggca tttggatggg 
atggagaaga ggctggtatt ggggatgggg 
gaaatgagga agacgttggg gattaggcta 

30 gggaggtggg tttgaagata tgagggtttg 
aaacatagaa gaggtaggag gtaggttgga 
cagttgggtt tgtaatggga atggggtaag 
ttttttgaga cagggtctca ctctgtcacc 
ttcactgcag acttgaactc ttgggtctca 

35 gctgggacta caggcgtatg ccaccatacc 
gtgtgtgtgt gtgtgtgtgg agatgaggtc 
cctgggctca agcgatcctc ctgcctcagc 
accaatcttg actggagttc atgttgaggg 
ctgactcaga tcttctctcc ctcagcctgg 

40 ggaccctgcg acaagacatc tcacccctac 
ctctgtggtg gggtccttat ccatccactg 
ccgtgagtct acactgtaaa tgaacagcag 
atgtcaggca ggaggtgaca taggcatccc 
caggtgcatt cggctgttgc ttaattgagt 

45 gcagtggaaa agaaaataaa aaaaagaaaa 
ctgaacttac tttctaatgg gggaattgga 
atttggattt aattctgagc acagtaggaa 
tttgtttttt gagacacagt ctcgctctgt 
cagctcactg caacctctgc ctcccaggtt 

50 tagctgagat tacaggtgtg caccaccttg 
cggggtttca ccatgttggc caggctggtc 
cctcgccctc ccaaagagct gggattacag 
tacattttta caagcaccct ggctaccacg 
ggaggcccac gtgggggctg ttgctttcat 

55 ggcggtcgca gtggggatgg agggatgttg 
agccagcaga atctggcaac gaggaacagg 
tgtatttgtc ctgaacaact gggtgttttg 
aaagagaaac aggccgggtg taggcagggg 
agttggagat gccggggaga tgtcccagca 



aggggcgggg gccagtgtgg tgacacacgc 
tctctcctgg ggacacagag gtcggcaggc 
tgagtcaccc caaccgcact gaacctgggc 
agcccggagt gagggcctgc aggtccctgg 
ggcctggggc accatctgct tgccccagag 
tccaaaaccc ggaggtctaa ggactgagcc 
ccccagcccc tcctccctca gacccaggag 
caggagtcca agttcccacc tcctccctca 
tccctcagac ccaggagtcc aagttctcac 
caagcccctc ctccctcaga cgcaagggtc 
aggagtccag gcccccaagc ccctcctccc 
ctcagggacc agtgctccct tccctggagg 
gggggcggga ggaatgtgtg tgggaggcct 
cctggctcag ggacctgaga gacaggggtt 
gaaccagcct cttccaggga ggcctgggag 
ccctgggctc tgcggcccct gcgtcctccg 
ctctcctcac ggctccccgc atttctaact 
ttcccccgac tcaagaatcc ccggaggccc 
tcacagaacc tgccgggtgg gcggggtggg 
gaaaactccc atacagagga agaacgctag 
aggaaggcaa atcagtgccc aagaaagtag 
actggaccag tggacgtagt ^tgaattctct 
gatgtgggct gaggaagaat aaatcaggag 
gagaccatag caggccagga aggaaggaga 
gggagggtcg aggcagtgac taatggacca 
agaagctgat ggtggtgctg agtctgattg 
ggaggcttgc agacagggtt gggcttgttg 
atatgcacag ggttggggtg ggggagcttt 
agggtgggga atacagatag ggagggtggt 
gggtggggtt ggctttaggg atggggatct 
aagttggaga gagcccggga ataggggata 
tttgggagtg gaaatacaga gaagcttttt 
caggctggag tgtagtggca tgatccatag 
agtgaccctc ccacctcagc ctcccaagta 
ctgctaattt gtgtgtgtgt gtgtgtgtgt 
tcactgtgtt accgaggctg gtctcaaact 
tgggattaca ggcataagcc actgcacctg 
ggatgcgctt ggtttctcca gaactcctct 
gcagaggagc agaataagtt ggtgcatggc 
caagctgccc tctacacctc gggccacttg 
tgggtcctca cagctgccca ctgcaaaaaa 
atgcgactga accctgaggg tgtcttatag 
ccccatccca gcacgaggcc atctgatagc 
acttaatgtg tgccaggccc tgcgggcata 
caaaaaaaaa caagcaaaat tgctgttttc 
tcatttgggg acctgcaggg cgtgatgggc 
gccactgggc agttttgttt ttgttgtttg 
cacccaggct ggagtgtagt ggcatgatct 
ccagcgattc tcctgcctca gcaccccaag 
cctggctaat ttttgtatgt ttggtagaga 
tcgaactcct gacctcaggt gatccgcccg 
gcatgagcca ccaccacacc cagcctgatt 
tggaacgtgg tctgggcaag agagagggag 
ccggcgacat aggagggtgg cttgaaccca 
aatatcttgg gatgtggaat tctgagactg 
agggagagga agaagcacgg ctggcttccg 
ccacgtcttt ctctgagttg tgggagaggg 
agcatctgac attttgcttt agccacgatg 
gggaggccag ggaggactct ggagctcaga 
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ggagaggtca 
ccatgggact 
ggacaaaaac 
gatgctaact 
accacaaggg 
aagtgacaat 
ccatcattca 
ttgggcagtg 
caaggacaga 
atcccagcac 
aacatggctc 
cttctgtaat 
ggaggttgca 
tctgtctaaa 
catgcctgta 
tacaaagctg 
gagcgagacc 
tataattaca 
ggaaggactt 
tcagggcaga 
agaaaataaa 
tgactacagc 
ttaggacaat 
atatgtatca 
tgaatagaac 
atcagccaca 
ttcaagtact 
gatcgagatc 
aacacaatga 
ctctggtgga 
ttttaattgg 
gtgagctatg 
ccctgaggtg 
ggagcagagg 
tgcttgcaag 
agtcatagga 
tggttgcttt 
gaggctactg 
gtgggaaagg 
ggacagcttg 
tgaaaggatg 
attcatgact 
cccagccctc 
aagggagagt 
tgccgccagc 
tgaactcatc 
catcctgggc 
tactggctac 
tgggaagatg 
gggggaggga 
gaccaatggg 
aaggtcagtg 
attcattgaa 
aaagcacaaa 
gaagagggac 
tccaatagat 
tgaaccattg 
tagaaaagga 
cagtggggga 



gggctggagg 
agatgagatc 
cctgggcgct 
accaatcagg 
actcttggca 
ccctggggtg 
ttcctttgat 
gagattcagc 
aaggtgcaga 
tttgggaggc 
aaccctgtct 
tccagcaact 
gtgagccgag 
aaaaaaaaaa 
atcccagcac 
cagtgagctg 
ctgtctcaaa 
aattatgaag 
tctaatgaga 
ggaaaggctg 
atttcccgtt 
cagatcacac 
ggggagccat 
aacacctatc 
aaaaatcccc 
tatagcaaat 
cagcagccac 
atggcatcgt 
gtaaatattt 
acagaaagca 
gcaactaagg 
cagatacttg 
ggaagatcac 
gagagaaggg 
gcctggtgtg 
ggggctgagc 
gtggaggatg 
ctctagttca 
tgagatgtgg 
gatgtagggc 
gaattgccat 
tcccagccct 
ttccttccca 
tcccaggagc 
catgaccagg 
cagccccttc 
tggggcaaga 
ttggggaagt 
ggctaatggt 
gagtgaattt 
tgaatagcat 
gggagatgct 
cagcaggaag 
agccaactga 
taaggggaaa 
cagcaggatc 
gatgaagggc 
ccaatgaggg 
tggtgaggcc 



ttaaaatgaa 
atccaaaaag 
gatcctcact 
tgctgagtga 
ccattttgca 
gttcccctga 
gtacattgac 
aatggatggg 
caagcaaagt 
cgaggtgggt 
ctactgaaaa 
tgggaggcta 
atcgcgccac 
ggaaagaaag 
tttgggaggc 
tgatctacag 
aaaacaaaca 
gaaaagaata 
taaaatccaa 
tgataacacc 
cactgggggg 
aggggctcca 
gggtgatgtc 
ctgtgccagg 
atcttgacat 
tacattgaaa 
ctgtagcttg 
agcatttagt 
aacaataaat 
ggggagggag 
aaggcttccc 
gaggacagac 
tattgtgttc 
gagagtggga 
ccacgttgag 
agaggaggca 
gactgtgggg 
ggtaggaagt 
ccagattctg 
atgaaataaa 
ttacccagct 
ctctgaagcc 
ggaatcttca 
agagttctgt 
acatcatgct 
ccctggagag 
cagcagatgg 
gtgccaaagg 
gaggaccaat 
gggagctggg 
gggagagatg 
aatcaggaag 
gaataatgga 
aggatgtgaa 
ggatcaatgg 
catgaaggtg 
cagtgggaag 
aggtggacca 
agttagaaaa 



ggcatcgtca 
ctggcatagt 
agtcagattc 
aaccatgtac 
aatgaggaaa 
ccccaaggag 
taagagcacc 
acacacacgt 
gagggctggg 
ggattacctg 
tacaaaaaat 
aggcaggaga 
tgcactccag 
aagcagcaaa 
cgaggcgggt 
aacaccactg 
aacaaaagaa 
cgggtaccta 
gcggaggcat 
cctgaggtga 
cagaaggtgc 
gtgccaagtg 
tgagcaaggg 
tgctgatcaa 
cctagagctg 
ttaatgaaat 
tggttccccc 
ggacagcatt 
atatagcagg 
ataggaattg 
tgagaggcga 
ttgctggcag 
aaggcaagac 
ggagaagatg 
aactttggct 
caggaccaac 
gaccagagac 
gaaaaggcag 
gatatgcttc 
gagagtgaag 
ggggaagact 
tcaactgcag 
ggtcttcctg 
tgtccgggct 
gttgcgcctg 
ggactgctca 
tcagtagtgg 
atggggagtg 
gggacagggt 
ccagtgagtg 
gaacataaga 
gatgtcaaag 
gaaggaactg 
ttgagacagt 
tccagaggag 
ggcctgtgtg 
gcagagacaa 
ttggatgaag 
ggaccaagga 



gcaaacaggt 
tggaggagct 
acgacagctg 
acacctttcc 
ctgaggtgca 
accttggatg 
tgctaagtgc 
catccctgcc 
catggtggct 
agttcgagac 
tagccaggcg 
attgcttgaa 
cctgaaccac 
ttgggctggc 
ggatcactcg 
cagatccagc 
gcaaaccctt 
ctttagatgg 
gaagatggga 
gaaccgtctt 
tgggagataa 
gaggagccca 
agtgactctc 
cgcactggag 
cactgtctaa 
ggaaaatcca 
agccacctct 
gctctgcaag 
tcggatgatt 
cctactaaca 
catttaaagg 
agggaacagc 
agggaagcca 
tctgtgagat 
ttgattctga 
ttacattgtt 
agagcaggga 
ctcaaaccaa 
agagaggcaa 
aatagccccc 
gtgggaggag 
cccaagggct 
gggaagcata 
gtgatccacc 
gcacgcccag 
gccaacacca 
gaggctggtg 
ggaaaattgg 
ttcaatggga 
aacagccaat 
tgaaggttca 
gtcaaagggg 
atggaagaag 
gaatgggggt 
tcactagagg 
tgaagggcca 
tgggggagga 
ggctaatagg 
gggaagcaga 



gtatttaaag 
ggagggccca 
ccacttgttt 
tggaatgccc 
gggaaatagc 
actctcacca 
cacattcgag 
ctcgggagca 
cacgcctgta 
cagcttggcc 
tggtggtggg 
cgtgggaggc 
agagcgagac 
cgtggtggct 
agcccaggag 
ctgggtgaca 
caaaacccca 
aggagggtca 
aaaggaatgt 
gagtattctc 
ggttggaaag 
ggctttattc 
tgtt'tcagga 
atactatatc 
tatggtagcc 
caagccacat 
ggacagtgca 
gaggagaaat 
gtgataggtt 
ggtatttgta 
aagtgaggga 
agtgcaaagg 
gcgtttggct 
gatggggcag 
gtgagatggg 
aaaatatctc 
gcccagtgag 
gatggtagcc 
aaggaattct 
aagattattc 
caggccagcg 
ccaggtgaga 
accttcggca 
ctgactatga 
ccaaactctc 
ccagctgcca 
gggagcaggc 
tgaggggcca 
gaaaggtcaa 
ggaaaatgta 
ataaagaggg 
actgatcagg 
agaaaccaat 
atagctgatg 
aaaaaacagg 
ataagaaagg 
tgcggcaagt 
aagggagagc 
ccaataggaa 
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gagagagcca atgagggagg gcagggccag 
attggaggaa gggccaatag aaagggagga 
aggaccaatg atggaggtgg accattggat 
gagagggcat ggccagttag gaaaagacca 
5 caatgggcag gaagtgtcca atgaagaatg 
agggcgtaac agaggaagag tcctccaggt 
gtggaagaga gaaaagtgga ggagggacct 
gactcctgga gaagagacta ttaatgagga 
aagagggacc aattaggagg cagggacgat 

10 aggaagaggg gtgccaatag aaaagaggga 
ggggggtgac tggggaggat gaggggagtg 
cccctaacag gtgatttccc tgacaccatc 
gaggagtgtg agcatgccta ccctggccag 
gagaagtacg ggaaggattc ctgccaggtg 

15 agggacagga cgaagtcaca aaaacatggc 
aagagagctt tacagagaca gatagagaca 
agagacttag ttcaacacac agagacacag 
ccagcagaga caggaagtgc agagacaagg 
gccaggagca gcggctcatg cctgtaatcc 

20 cacctagggt caggagttcg agaccagcct 
aaaatacaaa aattaggatg ggcacagtgg 
gccgaagcag gaggatcacc tggggtcagg 
aaccctatct ctactaaaaa tacaaaaatt 
cccagcacct tgggaggccg aagcaggagg 

25 ctggccgata tggtgaaacc ctatctctac 
caggcgcctg tagtcccagc tactcaggag 
ggcggaggtt gttgcagtga gtcgagatca 
aagattccgt ctcaaaaaaa aaccaaaaaa 
ctgtagtccc agctactcgg gaggctgagg 

30 ( ggctgcagtg agctgagatc acgccactgc 
tctaaaaaca aaaagaacca aagagaagta 
ccttcctcaa acagagcccc cacgagtcct 
agacactagc tggggaaagg ggactccctc 
gtcatccatc caggctctcc tctttatgcc 

35 agaccaacca agggggagac acaggcagaa 
acagggaaag cgatacatag caagttggac 
ctcaagacac gaggtggaga ggtgtccctg 
gggccctccc tacctctccc acctgggtct 
ctcctcttcc tcctcctcct cctccctcat 

40 gtctctacac ctctgcctct ctccacacct 
tcttgctctc tatgttcctc tgcatcttgg 
ttattctctc tctaccattc tctctctgtg 
tctctctgtc cctgagtctt tctctccatc 
tctctctctg tcacacacac acacacacac 

45 tctgggtttc tatctgtatc tgactttctc 
cgctggtatg tggagaccac ctccgaggcc 
caaaggagaa gccaggagtc tacaccaacg 
ccattcaggc caagtgaccc tgacatgtga 
ctggttccag aacgtctctc acctagacct 

50 ccctgatgct taataaacgc agcgacgtga 
ctccatcctt gcatcactgg ggaggacgtg 
ccccaccact aagagaatac aggaaaatcc 
cacacgtttg atttcttcct gcagaggccc 
cttactgtcg gtgtcccctt gggatgtacc 

55 atgaagataa ggatgataca gtctccataa 
cacacctatg acatacatgg aatagcacct 
ttt 



ttaggaaagg accaatgagg aaggtagacc 
tccatgaggg agggtgggga cagttagaaa 
gaagaaccaa tagaaaggaa gaaccaatgg 
atggtcacag agtgaccaat caagatgaat 
gactactgat caggaggggt acagtagagg 
caactgaaac tactgaagaa ggtgggacca 
aagagaaaag gaaaaccaat aggaaatgag 
agacagccaa tgggggggaa gaatgataga 
ggtaatgaga tgtaagaatg agagacaaac 
ccaatagagg atggaggact tataggggtt 
caaggcctgg gctgagtctg gcccatctct 
cagtgtgcat acatccacct ggtgtcccgt 
atcacccaga acatgttgtg tgctggggat 
aggtgacccg gatctgccac ttacacagcc 
cagacacagg aagagagaga cacaggccaa 
ggctgaggga gaacccaagc cttgaaaaga 
tcagggatat gcagagatat aaagacacag 
atggaggccg cgggatcaag aaccagagag 
cggcactttg ggaggccgaa gcaggaggat 
gatcaacatg gtgaaaccct atctctacta 
ctcatgcctg taatcccagc accttgggag 
agttcgagac cagcctgatc aacatggtga 
aggatgggca cagtggctca tgcctgtaat 
atcacctggg gtcaagagat tgagaccagc 
taaaaataca aaaattagct gggcctggtg 
gctgtggcag gagaatcact tgaacctgga 
tgctactgca ctccagcctg gcaacagagc 
caaaaattac gcaagcatgg tgggacacac 
ctggagaatt gcttaaaccc aggaggcaga 
actccagcct ggggacagag ccagactctg 
gtaaggaagc agatggtgtg aggggactgt 
gctcagaaac gaccaggctc tggaggaggg 
ccgaatactt taacttgggt ttcctccatt 
agaatgacta atgcactgag ggatgtgcag 
aeggagacac aggcagaaac agggacagag 
gcaaagaaag ggcaggtggg cgagactgtc 
gacagaatag tgccaggcat atctctccct 
tatcgtctcc tcctccccct cctccctctc 
catcttcttc ttttctctct ctctccatcg 
ctcagtctcc attcttaaat tgtttctctt 
cattcctatc tctgtgtctt tgagtctcct 
cctttgtgtg tcttactgtc tctctctctg 
tttcagtaag tacctctgtc cctttctacc 
acacacacac acacacacac acacacagtc 
cctctttcct gcagggtgat tctgggggtc 
ttgtgtcatg gggtaacatc ccctgtggat 
tctgcagata cacgaactgg atccaaaaaa 
catctacctc ccgacctacc accccactgg 
tgcctcccct cctctcctgc ccagctctga 
gggtcctgat tctccctggt tttaccccag 
atgagtgagg acttgggtcc tcggtcttac 
cttctaggca tctcctctcc ccaacccttc 
agccacgtgt ctggaatccc agctccgctg 
tttcttcact gcagatttct cacctgtaag 
ggcagtggct gttggaaaga tttaaggttt 
gggccaccat gcactcaata aagaatgaat 
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SEQ ID NO. 9 

KLK6 nucleic acid 



5 CDS 246. . 980 

aggcggacaa agcccgattg ttcctgggcc ctttccccat cgcgcctggg cctgctcccc 

agcccggggc aggggcgggg gccagtgtgg tgacacacgc tgtagctgtc tccccggctg 

gctggctcgc tctctcctgg ggacacagag gtcggcaggc agcacacaga gggacctacg 

10 ggcagctgtt ccttcccccg actcaagaat ccccggaggc ccggaggcct gcagcaggag 

cggccatgaa gaagctgatg gtggtgctga gtctgattgc tgcagcctgg gcagaggagc 

agaataagtt ggtgcatggc ggaccctgcg acaagacatc tcacccctac caagctgccc 

tctacacctc gggccacttg ctctgtggtg gggtccttat ccatccactg tgggtcctca 

cagctgccca ctgcaaaaaa ccgaatcttc aggtcttcct ggggaagcat aaccttcggc 

15 aaagggagag ttcccaggag cagagttctg ttgtccgggc tgtgatccac cctgactatg 

atgccgccag ccatgaccag gacatcatgc tgttgcgcct ggcacgccca gccaaactct 

ctgaactcat ccagcccctt cccctggaga gggactgctc agccaacacc accagctgcc 

acatcctggg ctggggcaag acagcagatg gtgatttccc tgacaccatc cagtgtgcat 

acatccacct ggtgtcccgt gaggagtgtg agcatgccta ccctggccag atcacccaga 

20 acatgttgtg tgctggggat gagaagtacg ggaaggattc ctgccagggt gattctgggg 

gtccgctggt . atgtggagac cacctccgag gccttgtgtc atggggtaac atcccctgtg 

gatcaaagga gaagccagga gtctacacca acgtctgcag atacacgaac tggatccaaa 

aaaccattca ggccaagtga ccctgacatg tgacatctac ctcccgacct accaccccac 

tggctggttc cagaacgtct ctcacctaga ccttgcctcc cctcctctcc tgcccagctc 

25 tgaccctgat gcttaataaa cgcagcgacg tgagggtcct gattctccct ggttttaccc 

cagctccatc cttgcatcac tggggaggac gtgatgagtg aggacttggg tcctcggtct 

tacccccacc actaagagaa tacaggaaaa tcccttctag gcatctcctc tccccaaccc 

ttccacacgt ttgatttctt cctgcagagg cccagccacg tgtctggaat cccagctccg 

ctgcttactg tcggtgtccc cttgggatgt acctttcttc actgcagatt tctcacctgt 

30 aagatgaaga taaggatgat acagtctcca tcaggcagtg gctgttggaa agatttaaga 

tttcacacct atgacataca tgggatagca cctgggccgc catgcactca ataaagaatg 
tatttt 



SEQ ID NO. 10 

hk7 amino acid 

MARSLLLPLQILLLSLALETAGEEAQGDKIIDGAPCARGSHPWQ 
40 VALLSGNQLHCGGVLVNERWVLTAAHCKMNEYTVHLGSDTLGDRRAQRIKASKSFRHP 
GYSTQTHVNDLMLVKLNSQARLSSMVKKVRLPSRCEPPGTTCTVSGWGTTTSPDVTFP 
SDLMCVDVKLISPQDCTKVYKDLLENSMLCAGIPDSKKNACNGDSGGPLVCRGTLQGL 
VSWGTFPCGQPNDPGVYTQVCKFTKWINDTMKKHR 



SEQ ID NO. 11 
KLK7 nucleic acid 

CDS 16. .777 



50 ggatttccgg gctccatggc 

ttagccttgg aaactgcagg 

tgtgcaagag gctcccaccc 

ggaggcgtcc tggtcaatga 

tacaccgtgc acctgggcag 

55 tcgaagtcat tccgccaccc 



aagatccctt ctcctgcccc 
agaagaagcc cagggtgaca 
atggcaggtg gccctgctca 
gcgctgggtg ctcactgccg 
tgatacgctg ggcgacagga 
cggctactcc acacagaccc 



tgcagatcct actgctatcc 
agattattga tggcgcccca 
gtggcaatca gctccactgc 
cccactgcaa gatgaatgag 
gagctcagag gatcaaggcc 
atgttaatga cctcatgctc 



WO 2004/075713 



35/47 



PCT/CA2004/000281 



10 



gtgaagctca 
cgctgcgaac 
gatgtgacct 
tgcacgaagg 
tccaagaaaa 
caaggtctgg 
actcaagtgt 
cacactgagt 
tgacctatga 
ctgttgataa 
aaccctcaa 



atagccaggc 
cccctggaac 
ttccctctga 
tttacaagga 
acgcctgcaa 
tgtcctgggg 
gcaagttcac 
taattaactg 
agtcaaattt 
accaatcaaa 



caggctgtca 
cacctgtact 
cctcatgtgc 
cttactggaa 
tggtgactca 
aactttccct 
caagtggata 
tgtgcttcca 
gactttacct 
ttggtaaaga 



tccatggtga 
gtctccggct 
gtggatgtca 
aattccatgc 
gggggaccgt 
tgcggccaac 
aatgacacca 
acagaaaatg 
ttcctcaaag 
cctaaaacca 



agaaagtcag 
ggggcactac 
agctcatctc 
tgtgcgctgg 
tggtgtgcag 
ccaatgaccc 
tgaaaaagca 
cacaggagtg 
atatatttaa 
aaacaaataa 



gctgccctcc 
cacgagccca 
cccccaggac 
catccccgac 
aggtaccctg 
aggagtctac 
tcgctaacgc 
aggacgccga 
acctcatgcc 
agaaacacaa 



15 



20 



25 



SEQ ID NO. 12 
KLK7 nucleic acid 



mRNA 

join (175 6. .1785,317 9. .330 9,3722. .3869,4566. .4813,5129. . 5265, 7362 .. 8265) 
/product=" stratum corneum chymotryptic enzyme" /note=" alternatively 
spliced" 

mRNA join (1756. .178 5,317 9. .3309,3722. . 38 69, 4566 .. 4 813 , 
5129. .5265,7362. .7991) /note="alternatively spliced" 

mRNA 

join (1821. .1864,3179. .3309,3722. .3869,4566. .4813,5129. .5265,7362. .8265) 
/product=" stratum corneum chymotryptic enzyme" /note~"alternatively 
spliced" 



30 



35 



40 



45 



50 



55 



mRNA 

join (1821. .18 64,3179. .3309,3722. 
/note="alternatively spliced" 



.38 69,456 6. .4813,512 9. .5265,7362.-7991) 



CDS join (3237. .3309,3722. .38 69,45 66. .4 813,5129. .52 65, 7 3 62. .7517) 



ggcatggtgg 
gaacacgggg 
tgacagagcc 
gtgtccatgc 
tttctgttcc 
aaggacatga 
attttcttaa 
gtcaatagtt 
cctttgggta 
agaaaaaaat 
caaaccccca 
ccgaactgaa 
cctcaaaaat 
atacaaagaa 
tggcaaatat 
tagagttggg 
tcccccaggc 
gcaagaaaat 

cggggccgag 

gaaacccggt 
tcccaggtac 
agtgagccga 
aaaaagaaag 



tgcacgcctg 
gagtggaggt 
agagtccatc 
agtctcatca 
tgctttagtt 
tcgtgttctt 
tccagtctat 
ctacaatgaa 
cacacactgg 
aactaatggg 
tggcgcacgt 
aagttaaaaa 
gttctattag 
atctatgtat 
tagtggggtg 
aatgggttca 
gggccagaat 
ggccccagaa 
gcgggcacat 
ctctactaaa 
tcgggaggct 
gatcgcgcca 
gaagggaggg 



taatccagct 
tgcagtgagc 
aaaaaaaaaa 
tttagctacc 
tgtaagggta 
ttttatggct 
cactgatgga 
cgtacgtgtc 
ggcttatgag 
tactaggctt 
tcacctacgc 
aagaaaaata 
ataatgttaa 
aacatgatta 
gggttaggga 
ggcgtaatta 
taggatgtag 
gccgggcgca 
catgaggtca 
aatacaaaaa 
gaggcaggag 
ccgcactcca 
agggaggagg 



actcaggact 
cgagatcgtg 
aaaaataaga 
acttgtaagt 
atggcctcca 
acgtagtatt 
catttaggtt 
catgtgtctt 
agggtggaga 
aatacctggg 
aacaaacctg 
aatatttgct 
gtacagacat 
aaatgaccat 
aagcgttgtt 
caggcacgac 
ggtctaggcc 
gtggctcacg 
ggagatcgag 
ttagctggga 
aatcacttga 
gcctggcgat 
gaagaaagaa 



ctgaggcagg 
ccattgcact 
aagattcttc 
aggaacatgc 
gctccattca 
caattgtgta 
gattccctgt 
taaacagaat 
gtgggaggaa 
tgattaaata 
cacatcctgc 
tataaattaa 
ttttgttata 
aagaacatag 
tttaacttac 
tgggatcagc 
acccctgaga 
cctgtaatcc 
accattctgg 
gtggtggtgc 
acctgggagg 
agagagagac 
agaaaaccgc 



agaatcactt 
ccagcctggg 
tctcctctat 
catatctggt 
cgtccctaca 
tacgtaccac 
gtttgctgtt 
gatttatatt 
ggagaggatc 
atctgtataa 
acatgtaccc 
taaatgaagc 
aatacataat 
atcctaaaca 
acctctctgt 
ttggacaagt 
gggggtgagg 
cagcactttg 
ccaacatagt 
gtgcctgtaa 
cggagctggc 
tccatccaaa 
cccagagaag 
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gacccgagcc agagcctatt ctctgagctc agcgactgct tgaatcccgc tcctgcccct 

cagacccagc gcaccgggtc cctcccccga gagcagccag gagggactgt gggaccagaa 

- tgtgcggggg cgcaggagct gggcaccgcc cgtccttcgg agggagggtg gagagagagt 

gcagtggtgc caattgctct cgctgcgtca gggttccaga" taaccagaac cgcaaatgca 

5 ggcgggggtg tcccagagtc ggctccgcct gcaccccagg gcgctggggc cgggcatggg 

gcggggggtg atataagagg acggcccagc agagggatga agattttgga gcccagctgt 

gtgccagccc aagtcggaac ttggatcaca tcagatcctc tcgaggtgag aagaggcttc 

atcaagggtg cacctgtagg ggagagggtg atgctggctc caagcctgac tctgctctcg 

agaggtaggg gctgcagcct agactcccgg tcctgagcag tgagggcctg gaagtctgca 

10 atttggggcc ttttagggaa aaacgaacta cagagtcaga agtttgggtt ccacagggaa 

gggcaagatc ggagcctaga ttcctgggtc tctagggatc tgaagaacag gaattttggg 

tctgagggag gaggggctgg ggttctggac tcctgggtct gagggaggag ggcctggggg 

cctggactcc tgggtctgag ggaggagggg ctgggggtct cgactcctgg gtctgaggga 

ggaggggctg ggggcctgga ctcctgggtc tgagggagga ggggctggga cctggactcc 

15 taggtctgag ggaggaggag ctggggcctg gactcctggg tctgagggag gaggggctgg 

ggcctggact cctgggtctg agggaggagg ggctggggcc tggactcctg ggtctgaggg 

aggaggggct ggggcctgga ctcctgggtc tgagggagga ggggctgggg cctggactcc 

tgggtctgag ggaggagggg ctggggcctg gactcctggg cctgagggag gagggactga 

gacctggact cctaggtctg agggaggagg gactgggacc tggactcctg ggtctgaggg 

20 aggaggagct gggggcctgg actcctgggt ctgagggagg aggggctggg gcctggactc 

ctgggtctga gggaggaggg gttggggcct ggactcctga gcctgaggga ggagggactt 

ggacctggac tcctaggtct gagggaggag gagctggggg cctggactcc taggtctgag 

ggaggcgggg ctgggggcct ggactcctgg gtctgaggga ggaggggttg gggcctggac 

tcctgagcct gagggaggag ggacttggac ctggactcct aggtctgagg gaggaggagc 

25 tgggggcctg gactcctagg tctgagggag gaggggctgg gggcctggac tcctgggtct 

gagggaggaa ggtgctaggg tctggactct tgggtatgag ggaggaggag gttaggggtc 

tggacttctg agtgtaagga aggagaggcc agagaaagga atttctgggt ctgagggagg 

aggggctggg gttctggacc cctaggtctg agggaggagg ggctggggcc tggacccctg 

ggtctgaggg aggaggggct ggggccggta ctcctgggtc tgtgggggga ggggctgggg 

30 cctggacccc tgggtctgag tggggagggg ctgggcctga atgctttctc cttctcagct 

ccagcaggag aggcccttcc tcgcctggca gcccctgagc ggctcagcag ggcaccatgg 

caagatccct tctcctgccc ctgcagatct tactgctatc cttagccttg gaaactgcag 

gagaagaagg tgaaagctgg actgggaagt ctgacctcac ctcagggccc ccactgaccc 

tctccaagga gtccctgagt cagaaccctt ccctcctcaa acagcttcca tcctgggagg 

35 accagactgt cggctgaagc ccccgctctt cctgcttctg ctgactcagg gggtctctgt 

cccctccagg ccctgcctcc tgtgctcagg gtctctctgt ggttccccag atgagatgcg 

cctcctgggt ttctgagtgg gctccttctg tctgtctcta tccctatctc ttgctttctc 

tgtatttctc cacacatttt catctgtctc tgtccatctc tgactctggg aatccctgag 

gtgcagcctc agccttcccc taatgctagc tacccacgtg ctcctccatg tctccatcca 

40 gcccagggtg acaagattat tgatggcgcc ccatgtgcaa gaggctccca cccatggcag 

gtggccctgc tcagtggcaa tcagctccac tgcggaggcg tcctggtcaa tgagcgctgg 

gtgctcactg ccgcccactg caagatgaag taggtgccgc ccaagtctct gctggaggtg 

caccagcgtc tccagctcgc tatgggggtg gaagggcagt ctttctgtgc ctacggctct 

attctcctct ctctgggtct ctgtcctcct ctctctgggc ctctgtaccc cctctccctg 

45 gggctctgtc cccctctctc cctggctctc tgtctccctc tctctgggtc tctgtccccc 

tctctctgga tctctgttcc cctctctctg tgtctctgtt ccccattctc tctaggtctc 

tgttccccct. cctctctctc tgggtctctg tccctctctc tctggatctc tgtccccctc 

tctctctggg tctctgttcc cctctctctg ggtctctgtc ccctctcctc tctctgtgtc 

tctctccccc ctcctctctc tgtgtctctg tcccccctcc tatctctgtg tctctctccc 

50 ccctcctctc tctgggtctc tgtccccccc tctctgggtc tctgtctccc tctctctggg 

gctctgtccc cctctctctc tggatctctg ttcccctctc tctgggtctc tgtctcccct 

cctctctctg tgtctctgtc ccccctcctc tctctgggtc tctgtcccca ccccgtcccc 

caggtctttg cacaccctct ctgtcacagt gtctcttctg aatctgtgaa tgtcactcct 

cgcagtgagt acaccgtgca cctgggcagt gatacgctgg gcgacaggag agctcagagg 

55 atcaaggcct cgaagtcatt ccgccacccc ggctactcca cacagaccca tgttaatgac 

ctcatgctcg tgaagctcaa tagccaggcc aggctgtcat ccatggtgaa gaaagtcagg 

ctgccctccc gctgcgaacc ccctggaacc acctgtactg tctccggctg gggcactacc 

acgagcccag atggtaggtg gcctcagtga cccaggagtg caggccccag ccctcctccc 

tcagacccag gagtccaggc ccccagcccc tcctccctca gacccaggag tccaggcctc 
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agcccctcct ccctcagacc caggagtcca ggcccccagc ccctcctccc tcagacccgg 

gagtccagac cccagcccct cctccctcag acccagcagt cctgggcccc agaccctcct 

ccctcggaac caggagcctg aacaacagcc cttctggtcc tcgcccccat cctctctgac 

tgacagctct ccctgctcct ccctgcagtg acctttccct ctgacctcat gtgcgtggat 

5 gtcaagctca tctcccccca ggactgcacg aaggtttaca aggacttact ggaaaattcc 

atgctgtgcg ctggcatccc cgactccaag aaaaacgcct gcaatgtgag accctccccc 

ccaattcctc cccagtcctg ggtaccctgt ctgcatgccc cagggacaga gcttgacccg 

agtgactggg taccaagccc ggccttgccc tccccccagg cctggcctcc tcagcttttt 

ccacctcatt ctctgcctag gtcaggggtg ggagtttact taggggccaa tgtggccctg 

10 gggatgagac agagagttta ataggggtga gaaagtgggg gtgggaccag ggaaggagac 

tgaggtgctg gcctcaggcc caaaccttaa gggggcacca aaaacctcag tgattgagat 

aaatcataat gcaatattta aaaataaaaa taaaaactca tgcagaagtc catgatggac 

aaaatgtcac attttaaata aagagcaggt ggatcttact gaattttccc ttgccgtaag 

tactagcgtg gctcagcaca gcgctgtact ggcactgtct tcatttaaaa tgtggatacc 

15 atgcccatca tgcagtttta tgtattacat ttgatttcgt taagtactgc attgaagtat 

tgtgtattgc agttactgag attttgtgcc tgaagctgat gactcactca cctgaccctg 

gccctggtcc cggggaaaac actctttctc tccacctcct ctctgttccc tctttctggc 

cttttgtcat cccctctgtt tctgaacagt cttcccacat ctctctttgt gacataattt 

catttcattc ttttcctctt tgttttttct ctgtgttgag ctagcttgct ctccctccct 

20 tgttctctct ccatgccctc ctctctgctc tctgtcttct ccctctttct cttgcttctc 

tctctctcct cccctccctc tctcctctcc ctgcccccct gctctctctt ttttcctctc 

tctctgtctc ctctctggcc ctctcctctt tctctctctc ccccacttct ctgtctctct 

tcatctctct ccctcatctc tccttgcccc ctccttttta ctgtctctct ctttctcttt 

cttctatctc tctcctctcc ctgccgctcc cccatctctg tctttctttc tctctcttta 

25 ' ttctcctcct ctcttccagt ctctctctcc tctccccacc cccaccccat ctctctcccc 

acaccttccc cccctttctc tttgtctctc tcttctccct ctttcttctc cacccccatc 

tctctctctc ttctcttccc acaccctccc catctccctc atctctttgt ctgtctctct 

tctccctcct tcttttccac ccccatctct ctgtctctct ctctccccat accctttccc 

tcttcctcat ctctctttgt ctctctctcc tttccctctt tcttctccac ctccaactct 

30 ctctgtctct ccacacccat cctccttgct cacatctgca ccttcagctg tcaggggatg 

tgggatggtg agtgttaggg atagaggaga tgggagagag atgactgtcc tagagaatag 

ggtgttcccc ttctcccctg gtgagggcca gtttcatgaa tgtgcaagct ctgcacggac 

acagagcccc acactcagaa gggtctcaaa cttagtctaa tgcattcctg ctgttgtctt 

gaaattctca ataatttttg aacaaagggc cctgcatttt cgttttgcac caagtcctgt 

35 aaattatgta actggtcttc accctggtct ccgagaccat cgtgtccccc tttcctgcgc 

cacagggcac gcatccaccc cttggagatg atgttccttc tcccactagc ttggagcagg 

gtccttaaca ttggaaaata aagagtgctc tgatcctgga agccccaccc cttctctgca 

attggtctca ttggccaagg gtcaaaccag tgtcttcaaa ggacctagtg tgtccctagc 

actagctctc ccattagtcc ccagagacaa tgagtctctt ctcattggct atggtggaag 

40 tccataatct gcaagacaaa gaccgataac tgaggaatgt atgagaatga gttgggcttt 

gatctgaagc caaagttaat ctccggctct attccctcta gggtgactca gggggaccgt 

tggtgtgcag aggtaccctg caaggtctgg tgtcctgggg aactttccct tgcggccaac 

ccaatgaccc aggagtctac actcaagtgt gcaagttcac caagtggata aatgacacca 

tgaaaaagca tcgctaacgc cacactgagt taattaactg tgtgcttcca acagaaaatg 

45 cacaggagtg aggacgccga tgacctatga agtcaaattt gactttacct ttcctcaaag 

atatatttaa accaacctca tgccctgttg ataaaccaat caaattggta aagacctaaa 

accaaaacaa ataaagaaac acaaaaccct cagtgctgga gaagagtcag tgagaccagc 

actctcaaac actggaactg gacgttcgta cagtctttac ggaagacact tggtcaacgt 

acaccgagac ccttattcac cacctttgac ccagtaactc taatcttagg aagaacctac 

50 tgaaacaaaa aaaatccaaa atgtagaaca agacttgaat ttaccatgat attatttatc 

acagaaatga agtgaaacca tcaaacatgt tccaaaagta ccagatggct taaataatag 

tctggcttgg cacaacgatg ttttttttct ttgagacaga gtctctgttg cttgggctgc 

aatgcagtga tgcaatcttg gctcactgca acctccgcct cctgggttca agtgattctc 

gtgcttcagc ctcccaagta cctgggacta caggtgtgca ccaccacacc aggctaattt 

55 tttgtgtatt tttactagag acagggtttc accatgttgg ccagcatggt cttgaacgcc 

tgacctcaga tgatccaccc accttggcct cccaaagtgc tgggattaca ggcatgagcc 

accacggcca gcccacaatg atattacaaa cctattaaaa atgatactta gacagaattg 

tcagtattat tcaagaacat ttaggctata ggatgttaaa tgacaaaagg aaggacaaaa 

atatatatgt atgtgaccct acccataaaa aatgaaatat tcacagaatc agatctgaaa 
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acacatgtcc 
tttccttgaa 
tctaagattg 
ttttgagccc 
tttttcctgt 

ggagggagga 

agcatctcac 
cagtagacag 
ggaagatggt 
gatgagggag 
aagatcctct 
acacgccctc 
tccagatcct 
agccctgtca 
taacacagaa 
agaccccccg 
tgtttctggg 
gttttagcaa 
aaaagtaaca 
ctgctctcct 
gttgattttt 
ggactagctc 
ttttccttta 
aagcacctac 
aaacattgt 



cagactgcat 
tgtgcacttt 
gagagaggtg 
tggtcctgcc 
ctgtaaaatg 
gagaacaggc 
gagtgacaag 
gacacagggg 
ggcacctgct 
gagtcctctg 
gtgtggtcac 
ctacctgttc 
tcccctcctt 
tattgcagaa 
aacgcaggag 
ccccaacccc 
cctgtcaagt 
cattttctct 
aaacattgca 
atacgcaagc 
tttaagtaat 
tggatcttgg 
agctcattac 
tttctagggt 



actggggtcg 
tataacatga 
acctttcagg 
ggccctgttc 
ggagagagag 
caacttcatc 
tgaggaggga 
tcccacgggg 
ccccaagaag 
tgactcagag 
acctcagacg 
ttcctgtttt 
atctcatctc 
attctgcagc 
tccaggcccc 
tcctccctca 
ttaagaatgt 
ctcttctgca 
tttgcactaa 
tacaggtaga 
tcatttttct 
cttcttgggt 
ttcccctgtc 
tattacagag 



tcatgaggtg 
aaaataaagg 
aagggagact 
cagggcatat 
gaaaggatgg 
agcgtgggaa 
ggctggcggt 
gtctgccaga 
ggagggaaag 
cctggccaca 
ctgctgaccg 
tctcccagaa 
cctctgagtc 
cgctaattct 
cagcccctcc 
gacccaggag 
caaacatttt 
aggcactcca 
gtcagcctgg 
ttggtttgca 
ttgggtaagc 
tcaaatccca 
cctgttcctt 
attcaataag 



tctccttcct 
tggggaaaaa 
agaaagaaat 
ttccatttcc 
agagaggaag 

ggggtgtgaa 

tttcagaggg 
agtaagcaaa 
gaacctcggg 
gccccagcca 
aggagccact 
ttccctcccc 
tctcctaacc 
gattctccca 
ttcctcagac 
cccaggtccc 
cgaccagtca 
acattcaatc 
agatccctgg 
atgactgaga 
agtatagtgt 
gttctagtcc 
catccttgaa 
ttaatataca 



tctgtgtact 

agtctgaaga 

atgtgcctgg 

cagatctcag 

aaggaaggga 

agtgtttctg 

attgggatga 

cagtgccgga 

aagcgggtag 

tctaacatca 

ccagcccagg 

accaagatcc 

caggcaccac ' 

tataggaggc 

ccaggagtcc 

cagccccttc 

ttcccctgaa 

tggaatttta 

ccctggccct 

tggtactaat 

^ggtagttaag 

ctacaagcta 

atgggagaaa 

gaaagtgctc 



SEQ ID NO. 13 

Hk8 amino acid 



30 



35 



MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQ 

PWQAALFQGQQLLCGGVLVGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQE I PVVQ 
S I PHPCYN S S DVEDHNHDLMLLQLRDQAS LGSKVKPI SLADHCTQPGQKCT VS GWGT V 
TSPRENFPDTLNCAEVKIFPQKKCEDAYPGQITDGMVCAGSSKGADTCQGDSGGPLVC 
DGALQGITSWGSDPCGRSDKPGVYTNICRYLDWIKKIIGSKG 



SEQ ID NO. 14 
KLK8 nucleic -acid 



40 



CDS 35. .817 



45 



50 



55 



gtgaccccgc 
caagacgtgg 
ggaggacaag 
cttgttccag 
tacagctgcc 
gaataaagat 
caacagcagc 
ggcatccctg 
ccagaagtgc 
cactctcaac 
ggggcagatc 
gggcgattct 
ctcagacccc 
ggactggatc 
ccttaataaa 



ccctggattc 
atgttcctgc 
gtgctggggg 
ggccagcaac 
cactgtaaaa 
ggcccagagc 
gatgtggagg 
gggtccaaag 
accgtctcag 
tgtgcagaag 
acagatggca 
ggaggccccc 
tgtgggaggt 
aagaagatca 
ctcacaactc 



tggaagacct 
tcttgctggg 
gtcatgagtg 
tactctgtgg 
aaccgaaata 
aagaaatacc 
accacaacca 
tgaagcccat 
gctggggcac 
taaaaatctt 
tggtctgtgc 
tggtgtgtga 
ccgacaaacc 
taggcagcaa 
tctggttc 



caccatggga 
gggagcctgg 
ccaaccccat 
cggtgtcctt 
cacagtacgc 
tgtggttcag 
tgatctgatg 
cagcctggca 
tgtcaccagt 
tccccagaag 
aggcagcagc 
tggtgcactc 
tggcgtctat 
gggctgattc 



cgcccccgac 
gcaggacact 
tcgcagcctt 
gtaggtggca 
ctgggagacc 
tccatcccac 
cttcttcaac 
gatcattgca 
ccccgagaga 
aagtgtgagg 
aaaggggctg 
cagggcatca 
accaacatct 
taggataagc 



ctcgtgcggc 
ccagggcaca 
ggcaggcggc 
actgggtcct 
acagcctaca 
acccctgcta 
tgcgtgacca 
cccagcctgg 
attttcctga 
atgcttaccc 
acacgtgcca 
catcctgggg 
gccgctacct 
actagatctc 
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SEQ ID NO. 15 

KLK3 nucleic acid 



5 CDS join (<1. .39, 418.. 712, 878 . .>946) 
Exon <1. .39 
Exon 418 . .712 
Exon 878 . . 946 

10 tcttcggttc ccggttactg gcagcagccc cctcctccca caaaagatca ggttccaagc 

ttctcctttt aaaagtactt agaattcagc ccccagctct ctcctccctc acacccagga 

atccaggccc ctagcccctc ctccctcaga cccaggagtc ctggccccta gcagccccct 

cctccctcag acccaggagt ctgggccccc agcccctcct cggtcagacc taaatcccag 

gtcccagtcc ctcctccctt agatttagga gtccaggccc ccagcctctc ctccctcaga 

15 cccaggaatc caggccccca gcctcctccc ctctcagaac taaaatcttg gcccccagcc 

ctttatgttt cagatcgtag agtctcagca ccgagtccct cctctcccta gcctcaggag 

tctgagattc cagcccctcc tccctcaaga tttcacgttc aatcccctcc gccccttctc 

actcacaccc agtgttccag ttcccagaag ctccccaggc tctagtgcag gaggagaagg 

aggaggagca ggaggtggag attcccagtt aaaaggctcc agaatcgtgt accaggcaga 

20 gaactgaagt actggggcct cctccactgg gtccgaatca gtaggtgacc ccgcccctgg 

attctggaag gtgaggtgca gaggtactca gatacagaca tcaggccccg gaccctcctt 

ctccagattc caggacccca gcctcagatg cccttctctg tcgagatcca gcagtctgga 

ccccggcttc ctcctctccc taatttagga gtcccagctc ccagctccct gtcccctcag 

acccagacat cgaggactcc cccctccctt ggaatgtagg aatccagtcc cccagcctcc 

25 tccttcctcc agagaagccc agaacagccc cagatactct cggctgcctc cccagtgccc 

aaatccagaa ctgggagctc aggctcctcc ttcctgttta ccggccccgc cctctccatt 

tcccagacct caccatggga cgcccccgac ctcgtgcggc caagacgtgg atgttcctgc 

tcttgctggg gggagcctgg gcaggtgagg agggttgcgg aggcctccgg aggggaggga 

tctgaaggca gcagtggcgc tggggagtct gtgggaatgc cgcgggggtt atgtgggtgc 

30 gtgtgcacgg atgtgaagag tgcgatacgg tgcaggagcc tctgtgggct ttcctcaggg 

tggacagagg caagaaacag gtagcagcag gtaggagtag gttccgtgat gctgtaaatt 

gtctgaatag ctacagcctt tgggggctgc ttgcttgggg gcatagattc acctgggagt 

actcggggcc tgtagactca tgtggaagca tgtgggggca ttcttgggtg tgtgactctt 

gtatgatgac acatggactg aaatgagtgt ccccgtgtgg cagcgtgtgg aagcctggac 

35 ctcctcacta agttgtatgc ggagaacttg ccgtgtgtcc atttgaaccc acagtggcct 

tcccagccct cgcactgccc cagagggtgg cgatccaacc ctctccctcc tgctgcagga 

cactccaggg cacaggagga caaggtgctg gggggtcatg agtgccaacc ccattcgcag 

ccttggcagg cggccttgtt ccagggccag caactactct gtggcggtgt ccttgtaggt 

ggcaactggg tccttacagc tgcccactgt aaaaaaccgt gagtggatga tgggggcaga 

40 ggtcagctgg ggcttaagga aagagggggc tggggtttcg actcaggaag gagagagctg 

aggactggac tcctgggtct gaaggaggag ggggctgggg gcaatacccc tgcctgggtc 

ccaaactatc cccaccatta caggaaatac acagtacgcc tgggagacca cagcctacag 

aataaagatg gcccagagca agaaatacct gtggttcagt ccatcccaca cccctgctac 

aacagcagcg atgtggagga ccacaaccat gatctgatgc ttcttcaact gcgtgaccag 

45 gcatccctgg ggtccaaagt gaagcccatc agcctggcag atcattgcac ccagcctggc 

cagaagtgca ccgtctcagg ctggggcact gtcaccagtc cccgaggtag tgggcttgtc 

cactaatggg agggagagga ggagctggtt ggcccagtgg aacccaagct attggcaaag 

cttggtcccc cagagggaga caaagaaggg aaagtgatca tgatgttgag attcacaagc 

aggagtcata tgagaagcct cgaagatctg actactaaca agagtggtga gagaaagaac 

50 caactagcag attgttaagc agggccagaa aagcccatcc tgtatggcgg agagcactac 

cctatgggac ctttggttgt ataggatttt atataatctt aacacacttg ggatattttg 

gacttctcag aggcccagga aaacaggctc ctaagcacct tctcccccac ctccctcctt 

tttttttttt tttttttttt ttttgcaggg ggacacagtt tcactcaatc gcccaggctg 

gagtgcagtg gcacaatctc agctcactac aatctctgcc tcctgggttt aagcgattct 

55 cgaacctcag cctcccgagt agctgggatt acaggcaccc gccaccacgc gagctaattt 

ttgtatttgt tgtagagaca gggtttcgcc atgttgacca ggctggtctc gaactcctga 

cctcaagtga tctgcccgcc tcggcctacc aaagtgctgg gattacaagc atgagccacc 

gcccctggcc tttttctcct tcttgaaccc aggaaacctg ggccctggtc acctcttccc 
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tcagacccgg gagtctaggc ccttcttctc ccaggaccca gcagtcctag ctccctcttg 

actctggacc ccaaaatctg gacctccaat gaagctgtcc ctttgggact ccagaatcca 

gagagcccct cgccttcctt cacagtgaaa acattgggac tcacctaaga gagtcaagga 

gcttctccag gaagtggcaa agtcagcatt caggtccctg cctgcctcac tcctgctctg 

5 aatgctttgg atgagacagt ttgcggctgt ggaaacacac gtgctcgcaa atcaagtaga 

tcagttcaaa ctatggctct gccctttctc cctgggcaaa ttcctttcca tctctgagcc 

tcactttcct tatctgcaaa atgggaatca ttaaccaata gattttttag ctacgtgagg 

gttcaggtta catcagtttt cttcattgtg ttctcctagt gctcagaatg gtgcctgcta 

catggtagat gctcaataaa tatttgttga atggatgacc tgatgaataa ataatgaaat 

10 gaatgaatgc atctctcctt caaagcgctg ttgtgaggat taaatgagat tatgagcata 

ttgcttttgg cacgtagaga tgccagatgg aagtttttcc ccctcaaaga ggctttggag 

aagtctattc ctcaaaagag gttaataaaa aagatcaatt ccaccttcaa tcattaattc 

aactcttatt tactgagcac ctagtatgcc tgaggtgctg ttgcaggcgc tggatataca 

gccatgagca aactgtacaa agtccttgtc gttatggagt tgcaagctag gtgggagaaa 

15 tagacaataa acaaatacac ataaaataaa acattaggca aagtgctgta aggaaatatt 

gtggagcgta aaaggatagg gagtaatgga gggatggtat tttttaattc gagtggtcag 

ggaaggcttc caggaggagg tgacacttgg aaggagcagg atttagcaca gattgaggac 

ccggttgtct gagggcagga agactggaaa gagagggtca gaggtatcat aaaggggcac 

ctccaagtaa cccccagccc cttgatttga aaatggctca gggtaagaaa aaacacgtga 

20 gatccaaggg cccctctcta gatggagaaa gcccaatagc aacaagtaca gcttcgttta 

atgtggtgag aagtgatgtc ccctgtgcac agtgtcagaa aattccccca tgcagctgga 

aaactccccc attacatcct ggaaagaaag ggggttagat ccagacaggg atggaggcaa 

agggctgctc tctcagggaa ccttacaacc tcttccccct cagagaattt tcctgacact 

ctcaactgtg cagaagtaaa aatctttccc cagaagaagt gtgaggatgc ttacccgggg 

25 cagatcacag atggcatggt ctgtgcaggc agcagcaaag gggctgacac gtgccaggtg 

agcaatttct gaaatccttc tcctcacaca tccctcattg ccctctcgag gttcaaggct 

tggatggggg tgggggtggt aagggagtga ccccaaagac ttggcacccg gagtgttcac 

ccctatctct acggattggg agccaggttc agagaagcca aactctctct ctgaaagtca 

cactgcatag ggattaggag cattgaattg ctgctgctgt tttctttcga acgcttaact 

30 acaggatggg atgcagagtt gggggctata gagggtgggg tagatgtcca gcaaggagag 

agtctgaggc tagagtttag acgagttgcc tgctctctgg ttcccagcat tacatcgtgg 

aatttgtgcc gactttacca ccggcccggt ggtcagccac ttttctttga gaccgggcta 

gaatcccaag gctggttccc ttctcctgtg attggttcct tgggagacaa tggtgtcttc 

ccagttggct ggagtaaatg gtgccattga cttcagtgct tcctcaatga gctccaatcc 

35 ctttctcttg ggatttgtca tagatataac tctctcttct aactgcaact tcgttgttcc 

tagcactttc ccctggcttt gtccacttcc tggaagcccc accacctttg ccaatgactg 

gtccctaaat acaatgcttt cttccccatt ggccaaaaat ggagtcgttt ccatcagcga 

tactgccatg aaagccagtc tctggattgt tctgtagaga tagtggtctc ttccacaaat 

atttcagcca tggtctcttg gggatatggt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 

40 gtgtgtgtgt gtgtccataa gaatcttgat cctttctcct atgtttggca actgcaatca 

atgatgcctt cactattggc caggaacaga ggaaacttca gctcagtcct ctccaagtaa 

tccctactgt cttctccctg gattggaccc tcgagaactc tttttttttt ttttttgaga 

cagggtctcg ctctgtcccc caagctggaa tgcagtggca caatcttggc tcactgcagc 

ctctgcctcc cagttcaagc aattctccca cctcagcctc ccgagtagct gtgattacag 

45 gtgtgcacca ccacacccag ctagtttttg tatttttagt agagacaggg attcaccatg 

ttggccaggc tggtctggaa cgcctgactt caagtgatct accgcctcgg cctcccaaag 

tgccgggatt gcaggtgtga gccaccaagc ctgtctggga tttcattctt tcccctcttc 

tgtcagtgtt ttgaccacta cccttagaca ccatgtctgt ctgtacatgg aagccccaag 

ccctgtcctg actggtctca ggggacaatg cttttacccc cattggctac aggggaccaa 

50 tcatgccaaa gaactggtaa aacgctggga cagcaggaaa agggacgttg tggacatctc 

agatgcaagg ctgttctcat tctccctgtc tagggcgatt ctggaggccc cctggtgtgt 

gatggtgcac tccagggcat cacatcctgg ggctcagacc cctgtgggag gtccgacaaa 

cctggcgtct ataccaacat ctgccgctac ctggactgga tcaagaagat cataggcagc 

aagggctgat tctaggataa gcactagatc tcccttaata aactcacaac tctctggttc 

55 cttgcctgtc tctgttttgg ccctgtgggg agggctggat ggggatccgg gattgttcct 

gctggcagac taggtgggga tgtgcagaaa ccaagttctc atggtcactt tggccatcac 

cactgcctaa agtgatccct cgttttctgg aagaacttgg gtaagagctt tatttcaggg 

gagaaaatac atacaaaggt cttcaaacat tcagtgggct ggtatgtgaa agacagtttt 

gaaagagttt gtgtttagtt ttcctgagca aagcatttac aagctttgga gataaaattt 
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tcccccttta aaaaataatt 



SEQ ID NO. 16 

5 HklO amino acid 

MRAPHLHLSAASGARALAKLLPLLMAQLWAAEAALLPQNDTRLD 

PEAYGAPCARGSQPWQVSLFNGLSFHCAGVLVDQSWVLTAAHCGNKPLWARVGDDHLL 
LLQGEQLRRTTRSVVHPKYHQGSGPILPRRTDEHDLMLLKLARPVVPGPRVRALQLPY 
10 RCAQPGDQCQVAGWGTTAARRVKYNKGLTCSSITILSPKECEVFYPGVVTNNMICAGL 
DRGQDPCQSDSGGPLVCDETLQGILSWGVYPCGSAQHPAVYTQICKYMSWINKVIRSN 



SEQ ID NO. 17 

15 KLK10 nucleic acid 

Gene 1...1580 
CDS 220...1050 



20 catcctgcca 
tcttggcacc 
ggcgtttcgg 
gacccaggag 
ctctccgccg 

25 ctctgggccg 
tatggctccc 
tcgttccact 
ggaaacaagc 
cagctccgcc 

30 atcctgccaa 
gtgctggggc 
cagtgccagg 
ctgacctgct 
gtggtcacca 

35 gactctggag 
tacccctgtg 
tggatcaata 
gttatgctcc 
tcggctgaac 

40 acatctcccc 
aaatgcagga 
agcctctgag 
gtgactttgg 
aacaatgacg 

45 gtaaatcttc 
aaggttacct 



cccctagcct 
gggacccgga 
gcactgggag 
tgccagcctc 
cctctggcgc 
cagaggcggc 
cgtgcgcgcg 
gcgcgggtgt 
cactgtgggc 
ggaccactcg 
ggcgaacgga 
cccgcgtccg 
ttgctggctg 
ccagcatcac 
acaacatgat 
gccccctggt 
gctctgccca 
aagtcatacg 
tgctgatcca 
tctccccttg 
tctcacctca 
agtggtggca 
agcagttact 
gcaagccaag 
tgcctacctc 
atggtgattg 
gttgtcgtga 



tgctggggac 
gaatccccac 
aagcctgtat 
acccacgcag 
ccgggctctg 
gctgctcccc 
cggctcgcag 
cctggtggac 
tcgagtaggg 
ctctgttgtc 
tgagcacgat 
ggccctgcag 
gggcaccacg 
tatcctgagc 
atgtgctgga 
ctgtgacgag 
gcatccagct 
ctccaactga 
gatgcccaga 
tctgcactgt 
ttcccccacc 
aaggtttatt 
ggggtcaccc 
tgccctctct 
ttagacatgt 
tcatgtaagg 



gtgaaccctc 
ggaagccagt 
tccagggccc 
atcctggcca 
gcgaagctgc 
caaaacgaca 
ccctggcagg 
cagagttggg 
gatgaccacc 
catcccaagt 
ctcatgttgc 
cttccctacc 
gccgcccgga 
cctaaagagt 
ctggaccggg 
accctccaag 
gtctacaccc 
tccagatgct 
ggctccatcg 
tcaaacctct 
tatccccatt 
ccagagaagc 
aacctgactt 
gaacctcagt 
tgtgaggaga 
cttaacacag 



tccccgcgcc 
tccaaaaggg 
ctcccagagc 
tgagagctcc 
tgccgctgct 
cgcgcttgga 
tctcgctctt 
tgctgacggc 
tgctgcttct 
accaccaggg 
tgaagctggc 
gctgtgctca 
gagtgaagta 
gtgaggtctt 
gccaggaccc 
gcatcctctc 
agatctgcaa 
acgctccagc 
tccatcctct 
gccgccctcc 
ctctgcctgt 
caggaagccg 
cctctgccac 
ttcctcatct 
ctatgatata 
tgggtggtga 



tgggaagcct 
atgaaaaggg 
aggaatctgg 
gcacctccac 
gatggcgcaa 
ccccgaagcc 
caacggcctc 
cgcgcactgc 
tcagggagag 
ctcaggcccc 
caggcccgta 
gcccggagac 
caacaagggc 
ctaccctggc 
ttgccagagt 
gtggggtgtt 
atacatgtcc 
tgatccagat 
tcctccccag 
acacctctaa 
actgaagctg 
gtcatcaccc 
tccctgctgt 
gcaaaatggg 
acatgtgtat 
gttctgacta 



50 



55 



SEQ ID NO. 18 

KLK10 nucleic acid 

Gene 1 . . 5574 

mRNA join (48. .12 0, 605. .7 01,24 55. .2635,3589. .3863,4195. .4328, 4793. .5474) 
CDS join (614. .7 01,2455. .2 635,3589. .38 63,4195. .432 8,47 93. .4 945) 
Promoter 1...47 

5'UTR join(48. .120, 605. .613) 
exon 48...120 
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exon 605...701 



ttggggtcaa aagggaaggt cccgccaggg gtccctgggc agaggatacc agcggcagac 

cacaggcagg gcagaggcac gtctgggtcc cctccctcct tcctatcggc gactcccagg 

5 tgaagctacc tgcaccccac ccgggttggg gtggattgcg agaggatggg tgggaacccc 

cgggccacag gcaggagccg gcttagagcc tcggtttctc cactgcggga cgcggaagtc 

cccccgctgt gaggttgaga aagaggctcc cactggctcc gagcctcgga tccccacccc 

gcgcgtgaag gagggggaaa cctcgggcgc ggcgtggctg cagccagggt agctggggcg 

cggagagcgc tccactcggg cacagggagg acgggaagat gccgcgaggg gcgtcattag 

10 ggtaattgtg cccattaccg tgttccagcc cgaatctccc gtctgccagc ccctggggtt 

atcggctgca ggtcaaaagg gtgcttgcgg gctggggtgg cccaggctgg gcggatgagc 

gcggcgaggt gggggtctct aacggagcat ctgttttaac ccgcccctgc acacacccca 

gcagatcctg gccatgagag ctccgcacct ccacctctcc gccgcctctg gcgcccgggc 

tctggcgaag ctgctgccgc tgctgatggc gcaactctgg ggtaaggtgg gggacagggg 

15 gcggggagag gcgccggtgg gaggcacggg cgggagggca atgtgttccc gtcaccaagc 

cccgcgcacc tctcctcccc cgagacccca gcacccaccc agcgctccgg agcacggccc 

gcccccaagt cagctgggcc cttcttctgg ctcggcccct gggtgacccg ccccactcag 

gccctgtccg atttctgcca cccggatcct cgctcttccg tggactcttc ggcgtgttct 

ttctcctcct cttacgccgc cccatcccgg ccccgctcca ttttacacta ccgtgttttg 

20 ttttgtttgt ttgtttgttt gagacggagt ctcgctctgt cgcccaggct ggagtgcagt 

ggcgcgatct cggctcactg caagctccgc ctcccgggtt cacaccattc tcctgcctca 

gcctcccgag tagctgggac tacaggcgcc cgccaccaag cccggctagt tttttgtatt 

tttagtagag atgggatttc accgtggtct cgatctcctg gcctcgtgat ccgcctgcct 

cggcctccca aagtgctggg attacaagcg tgagccaccg ctcccggcca caccacagtg 

25 ttttatcctg agtcttgcct taccgctttt tgccctctcc cctcactttt tttcttcctc 

tttccctttc tctctctttt ttctttttct ttctttcttt tctttctttc gttcttcttt 

tccttccttc ctttccttcc ttcctttcgt ttgtttcttc tttcttgttt tactttctct 

cttatttttt cttctttctt tcttgttttt tttctttctc tctctttttc tttccttctt 

tttttttttt tttttttgag acagggcagt gctctgtctc cgtggctgga gtacagtggc 

30 ccaatcagag ctcactgcag cctcgacctc ctgggctcaa gcgatactca gcctccagag 

tagctggtac cacaggcatg caccaccaca tccggctttt tttttttttt tttttttttt 

tttttttgag acagggtctc actctgtcgc ccagactgga gtgcagtggc ccaatctcgg 

ctcattgcaa cctccacctc ctgggctcaa gcgatcctcc cacctcagcc tcccaggtag 

ctgtgactac aggcgcatgc ctccgcgact acttttttgt tgttgttgtt gtttgtttgt 

35 ttttgtagag actgggtctt gctgtgttgc ccgggctggt cttgaattcc cgagctcaag 

cggtccaacc gcctcggctt cccaaagtgc tgggattaca ggcgtaaacc actgcgcccc 

acccctctcc tggttttcaa tcccgttttg ttattcacac cccttcctct ccccgatccc 

cgagttctat ccccgcaccc ttacctcccc gccgcgttca atccccgccc ctctatcgac 

cagcgacgtt ctagccagct ctccaggcgc gctgcgttca gtccctgccc tccagaccca 

40 ccctattctg tctcattact ccacgctacc ctatcccagc ttccttccac tttcacgcgc 

ttcttctcct cccattcctt cggtgcacgc gaaaccccca atatttccct accaccctcg 

cgttctggct gcgtcccccg tccccgaacg cagtccagtg ccacagccca gctccaaccc 

caaacccaga ccctgccctc cgtgctttga ttccgtcccc tttctttctc ccagccgcag 

aggcggcgct gctcccccaa aacgacacgc gcttggaccc cgaagcctat ggcgccccgt 

45 gcgcgcgcgg ctcgcagccc tggcaggtct cgctcttcaa cggcctctcg ttccactgcg 

cgggtgtcct ggtggaccag agttgggtgc tgacggccgc gcactgcgga aacaagtagg 

aggagatcca tccccgagga cgccacgggg ggctgtggag gcggcctcca gggaggagcg 

ggcagggcgg ggtctctcgg aactcccaca gctggaggtg cggtccccgg tgtccttcca 

gcaggaggag agaccgggct tgcgctgtgg ccgccagggg acggtgtggt ccttttccgc 

50 atttctgggc ccgtcactcc tctcccgctg attctccttg agctctagga ggaggtggtt 

gcctgcaacg gatagaagcc agggtccggt gtggtcagga atttagaact aaaggaaaag 

gttcacttcg tgagtccccg ttgaaggagg aagggttggg tattaccaca gagaaaatgt 

ggagttgggc tgggctcggt ggctcacgcc tgtaatccca gcactttggg aggccaaggc 

gggtggatca cctgaggtca ggagttcaag accagcctgg ccaacatggt gaaaccactt 

55 ctcaactaaa aataaaataa aataaaaaat aaaaaatttt aaaaaaatta aaaaaaaaaa 

aaggctgggt gtggtggcgg gtgcctgtaa tcccagctac tcaggaagga ggctgaggca 

ggagaactgc ttgaacctgg gaggtggagg ttgcagtgag ccgagattgc gccattgcat 

tctagcctgg gtgacaagag cgaaactctg tttcaacaaa gaaaagaaaa agagaaaaga 

aaatgtggaa ggcttaccta ggtgtccagg cccccagccc tcctcaattc ctgcagatcc 
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tcagagctca aacaactgat tcctcctccc catgtccact gaggtcccct tctcccacaa 
ggccctcttc cctcagactc ttcctatctc caggccctgc ttcactgccc acctgctttc 
ccagtccctg tgaagggttt gccttcacat gcctcttcct tcccccaggc cactgtgggc 
tcgagtaggg gatgaccacc tgctgcttct tcagggcgag cagctccgcc ggacgactcg 
5 ctctgttgtc catcccaagt accaccaggg ctcaggcccc atcctgccaa ggcgaacgga 
tgagcacgat ctcatgttgc taaagctggc caggcccgta gtgccggggc cccgcgtccg 
ggccctgcag cttccctacc gctgtgctca gcccggagac cagtgccagg ttgctggctg 
gggcaccacg gccgcccgga gaggcaagag ctggggctct gaggccagaa cctcaggagg 
agggggctga gggcctgaac ccctgggtct gaggaaggat gggctgggga ctggattcct 

10 ggatctgagg gaggacgggc tggggtccta gatgcctggg tctgtgagtc tgaggggagg 
aggggctggg ggcctggact cctgggtcta agtggggagg ggctggggcc aggattcttg 
agtctgaagg aggaggggct ggggcttagg atagaaacgg tcttgtatct ggactcctgg 
ctccccaagg attgggggct ggacccaggg attactggca tattctccct tcagtgaagt 
acaacaaggg cctgacctgc tccagcatca ctatcctgag ccctaaagag tgtgaggtct 

15 tctaccctgg cgtggtcacc aacaacatga tatgtgctgg actggaccgg ggccaggacc 
cttgccaggt agggtctgaa cagggagagt ctctgactcc tgggagggag gacagggagg 
ttatgggaaa agagcagacc ctgtgcccga tcccaaactc cattcccaaa cccatccttg 
accccaactc ttacccagac ctaaccccct cctcatccct atcctcaatc ccatttccat 
cctaacccca ccccattccc atctccaagc ccattttcat cccctcacct tccatgaact 

20 acaatcccaa cccaagtctc actgtgcctt cattctcatc ccccagccca acctcccata 
acctgaagtc cacctccatt cctaccttcc agctcatacc taattccaac cccatcccat 
cctcgtcttt atcccaaccc aaccccttcc ttccccacca ctgccccaga tcccaaagtg 
acagctctca cgttggcaca tttatttgat ctctcctttc tgccaccccc agagtgactc 
tggaggcccc ctggtctgtg acgagaccct ccaaggcatc ctctcgtggg gtgtttaccc 

25 ctgtggctct gcccagcatc cagctgtcta cacccagatc tgcaaataca tgtcctggat 
caataaagtc atacgctcca actgatccag atgctacgct ccagctgatc cagatgttat 
gctcctgctg atccagatgc ccagaggctc catcgtccat cctcttcctc cccagtcggc 
tgaactctcc ccttgtctgc actgttcaaa cctctgccgc cctccacacc tctaaacatc 
tcccctctca cctcattccc ccacctatcc ccattctctg cctgtactga agctgaaatg 

30 caggaagtgg tggcaaaggt ttattccaga gaagccagga agccggtcat cacccagcct 
ctgagagcag ttactggggt cacccaacct gacttcctct gccactcccc gctgtgtgac 
tttgggcaag ccaagtgccc tctctgaacc tcagtttcct catctgcaaa atgggaacaa 
tgacgtgcct acctcttaga catgttgtga ggagactatg atataacatg tgtatgtaaa 
tcttcatgtg attgtcatgt aaggcttaac acagtgggtg gtgagttctg actaaaggtt 

35 acctgttgtc gtgatctgac cacgtcccgg tgaaagcgtg tgtccaggga agaagtgcac 
agggtagccc ccaatcccaa ccttccatcc ccaaccctta gggatgatgg aaga 



SEQ ID NO. 19 

40 Hkl 1 amino acid 

MQRLRWLRDWKS S GRGLT AAKE P GARS S PLQAMRI LQL I LL ALA 

TGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAHCLKPRYIV 
HLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPVSITWAVRP 
45 LTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTM 

VCASVQEGGKDSCQGDSGGPLVCNQSLQGIISWGQDPCAITRKPGVYTKVCKYVDW1Q ETMKNN 

SEQ ID NO. 20 

Hkll amino acid 

50 

MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTR 

LLCGATLIAPRWLLTAAHCLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSL 
PNKDHRNDIMLVKMASPVSITWAVRPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHT 
LRCANITIIEHQKCENAYPGNITDTMVCASVQEGGKDSCQGDSGGPLVCNQSLQGIIS 
55 WGQDPCAITRKPGVYTKVCKYVDWIQETMKNN 
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SEQ ID NO. 21 
KLKll nucleic acid 



10 



15 



20 



aggaatctgc 
atcgggcaga 
catgaggatt 
caggatcatc 
cgagaagacg 
agcccactgc 
ggagggctgt 
cagcctcccc 
ctccatcacc 
cagctgcctc 
cttgcgatgc 
caacatcaca 
gggtgactcc 
ccaggatccg 
ggactggatc 
accctccatt 
caagaccctc 
aatcaacctg 
gactctggga 
tcctggccat 



gctcgggttc 
ggtctcacag 
ctgcagttaa 
aaggggttcg 
cggctactct 
ctcaagcccc 
gagcagaccc 
aacaaagacc 
tgggctgtgc 
atttccggct 
gccaacatca 
gacaccatgg 
gggggccctc 
tgtgcgatca 
caggagacga 
tccacttggt 
tgcgaacatt 
gggttcgaaa 
atgacaacac 
atatcaaggt 



cgcagatgca 
cagccaagga 
tcctgcttgc 
agtgcaagcc 

gtggggcgac 

gctacatagt 
ggacagccac 
accgcaatga 
gacccctcac 
ggggcagcac 
ccatcattga 
tgtgtgccag 
tggtctgtaa 
cccgaaagcc 
tgaagaacaa 
gtttggttcc 
ctttgggcct 
tcagtgagac 
ctggtttgtt 
ttcaataaat 



gaggttgagg 
acctggggcc 
tctggcaaca 
tcactcccag 
gctcatcgcc 
tcacctgggg 
tgagtccttc 
catcatgctg 
cctctcctca 
gtccagcccc 
gcaccagaag 
cgtgcaggaa 
ccagtctctt 
tggtgtctac 
ttagactgga 
tgttcactct 
cctggactac 
ctggattcaa 
ctctgttgta 
atttgctaaa 



tggctgcggg 
cgctcctccc 
gggcttgtag 
ccctggcagg 
cccagatggc 
cagcacaacc 
ccccaccccg 
gtgaagatgg 
cgctgtgtca 
cagttacgcc 
tgtgagaacg 
gggggcaagg 
caaggcatta 
acgaaagtct 
cccacccacc 
gttaataaga 
aggagatgct 
attctgcctt 
tccccagccc 
tgagtg 



actggaagtc 
ccctccaggc 
ggggagagac 
cagccctgtt 
tcctgacagc 
tccagaagga 
gcttcaacaa 
catcgccagt 
ctgctggcac 
tgcctcacac 
cctaccccgg 
actcctgcca 
tctcctgggg 
gcaaatatgt 
acagcccatc 
aaccctaagc 
gtcacttaat 
gaaatattgt 
caaagacagc 



61 

121 

181 

241 

301 

361 

421 

481 

541 

601 

661 

721 

781 

841 

901 

961 

1021 

1081 

1141 



25 



SEQ ID NO. 22 

KLKll nucleic acid 



30 



gene 2313. .7622 

mRNA join(2313. .2398,4189. .4263,5061. .5217,5545. .5810, 
6627. .6763,7158. .7622) 



CDS join (4224. .42 63,5061. .5217,5545. .5810,6 627. .67 63, 7158. .7310) 



35 tgataatagt gttctctctc ctcattggtc agggccccag ccattgtcct tgagagaatg 61 

ctcgactctt tatgttgtct tgacagcctc ccctgagatt ggtcattaat gactgtgctc 121 

tctctcctca ttggtcaggg ccccagccat tgtccttgag agaacctctg tcctttatgg 181 

agttccaccc ttcttccctg ggattggccc ctagagacag tggttcttct cttttggtta 241 

gccattgcca ttgtcctccg ggaaagtgat tatactcttt tgtctaatga ccagacttgg 3 01 

40 agccctcccc aaggcccagg actgggttga agggttgggg aggaaaacag aaataagatg 3 61 

tctcccttgt tcagacagta cttctcttcc cttccagggt gattctgggg gccccctggt 421 

gtgtggggga gtccttcaag gtctggtgtc ctgggggtct gtggggccct gtggacaaga 481 

tggcatccct ggagtctaca cctatatttg caagtatgtg gactggatcc ggatgatcat 541 

gaggaacaac tgacctgttt cctccacctc cacccccacc ccttaacttg ggtacccctc 601 

45 tggccctcag agcaccaata tctcctccat cacttcccct agctccactc ttgttggcct 661 

gggaacttct tggaacttta actcctgcca gcccttctaa gacccacgag cggggtgaga 721 

gaagtgtgca atagtctgga ataaatataa atgaaggagg ggccatgtct gtccatttga 7 81 

agtcctcatg ctggttgaga ctggaagaag gactcagcag tttccctatc tcataggagt 8 41 

agaaacagag ctcaaataag gccaggcaca gtggctcaca cctgtaatcc catcactttg 901 

50 ggaagctgag gcaggtggat cacctgaggt caggaactcg ggaccagcct ggtcaacata 961 

gtgaaacccc aactctacta aaaatgcaaa aattagccag gcatggtggc gcatgcctgt 1021 

aatcccagct actcaggagg ctgagacagg agaatagcat gaacccgtga ggcagaggct 1081 

gcagcgagcc gagattgaac cattacactc cagcctgggc gacagagcga gactccatct 1141 

caaaaacaaa caaacaaaaa acccagtgct caaataggat gagggtcttc cctgagtagt 1201 

55 tactcagaaa tggagtagaa aaagttactt ttaataatat aggccgggtg cagtggccca 12 61 

cgcctgtaat cccagcactt tgggaggccg aggtgggagg atggcttgag ctcagatttc 1321 

gagatcagcc tggcaacaca gtgaaatctt gtcactacaa aaacacaaaa aattagctgg 1381 
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gtgtggtggt gcgtgcctgt agtcccagct acttgggaag ctgaggtggg aggatcaccc 1441 
gagccgggga ggtggaggct gcaaagagcc gagatcatgc cactgcactc cagcctgggc 1501 
aataaagtga gaccttgtct caaaaacaaa aacccagcaa tataaataag acacatgttt 1561 
cttcatctgg cataatagaa atagtgccca gagcttataa gcttttcaag agtccacaaa 1621 
5 agacccgaaa aagaaaaaga aaattgttag ctccaaaata ccagatgaaa gctgcaaagt 1681 
caacatttat gaccatttaa tccaatgtcc ataaaacgta gcattctttc cactagccaa 1741 
ctgcagttta ctttcttgta atgaagcata cattgtatct ttaatgtggg acgtggcttt 1801 
gttctaataa gacgaagggt ggagtgcagg cttggaaagc aggagagctc agcctacgtc 18 61 
tttaatcctc ctgcccaccc cttggattct gtctccactg ggactcaaga ggtgaggaga 1921 

10 gaccatctcc ccaaatgcac tgaagggaaa ctggaggagg gagggagtga ggggtgatca 1981 
taccagcgga ggcacatttg ctgagccccc ccgcagtctg ctctttccaa gtggaccctc 2 041 
ctggaagcct gatcccaacc tcccctgcaa gcaggtctgt cacccccatc tctcagatga 2101 
agaaactgag ccttgcaggg gtggagtccc ttgtccccac gtcataaggg tagtcatagt 2161 
agtaggaaga ggaagcacct aggtttgagg ccagggctgg ctgctgtcag aacctaggcc 2221 

15 ctcccctgcc ttgctccaca cctggtcagg ggagagaggg gaggaaagcc aagggaaggg 2281 
acctaactga aaacaaacaa gctgggagaa gcaggaatct gcgctcgggt tccgcagatg 2341 
cagaggttga ggtggctgcg ggactggaag tcatcgggca gaggtctcac agcagccagt 24 01 
aagtgaacag ctggactcgg gctgcctggg cggcagggag aagcgggcag gggaagggtc 2 4 61 
agcagaggag cgaggcccca gaggagccct ggggtggagc acagccaagg gctctgttcc 2521 

20 ctttcctgga ctcggcttcc acaggccctg acctgcctcc cccaccctcc ggtcctgccc 2581 
ctgtgcctgg cagcagcccc acctgtgtga catcccagca caccccccct ctccttgcaa 2 641 
aggagaaggg agcggcctag gggaggccag gggcccacct gggctggggc tgtggagagg 2 7 01 
gagtggctgg gacgggagga aaaagagaga cggagattag atggaagaag agggatttca 2 7 61 
agacaaattg ccagagatgc agtcagagag actgactgag agacacaaag atagaaggaa 2821 

25 ttagagaaag ggccacacag agccagacag agagagaaga gtggagatgg agacagggac 2 8 81 
gaggacagag aaaggcagac agacacatag ggacagaaag agaaaaatca cacaaagtca 2 941 
gaattactga atgacaggga atgacacata gaacgagaca cagattcaga gactcagggc 3001 
agggaaagga aggctgcaga cagacagaca gacagaggga ggctgagaca cagggagaag 3061 
aggggcttgg agaggtggca caggcaggca gccagtgcct cagaggcctc cggggagggc 3121 

30 cctcacacac accccgcccc ggggcattaa ggcagggctt ggaggccagt catcctgggc 3181 
ccgcccaggg ccgcccccct gccagcccgc ctgcctggtg cctggcacct ggcgctccaa 3241 
cccagcctac ctgctgtagc tgccgccact gccgtctccg ccgccactgg gcccccagag 3301 
ccccagcccc agagcctgtg agtccaggag gaaagggaag ctgcccctcc ccgtccaggt 3361 
gtcagccctc cccaaggaca cctgtcccac tcgggcaccc atttctccct ctgctctgtc 3421 

35 cttctctgct tgggtggggg ttcctggcct ctctctacac ctctcacctc cgatggctgt 3481 
ccgcagcctc agttacctct aatctccatg gcttcagctg ctgagctggc cctctgctcc 3541 
cacccccgct ggccagggca gcggagggca ctggccctcc cctcgaccag ccccgcccag 3 601 
ctttgcttgg ctgtccttca aaagggcagg ggtttggcgg acagggcttc agcaagccgg 3 661 
gtgatggggg tcccagacat tgtctggggc tgagccccct actcccctcc agcagacctc 3721 

40 aaaggctcca tatcgctctg ctgcgaagac aatgaaaaag gggtggctac ggaacggtgt 37 81 
ctggttcccc ttgtccttcc accccaagct gctggggcct ggccagctct caaggcaaga 38 41 
aggaaaacat cctctgacat gtgccgggga ggtcccatgg ctgacttgaa cagggccgaa 3901 
ccatggcttg acagctcaaa gcccctccca acgacttcca catggttctt ggtatctcgg 3961 
aagcttctag ctgtgaccag gccctctcca aggccaccct agacacctaa gatatatttt 4021 

45 aagtgtttgg agatctgagt gctgtgagaa acaggggatt tccccaacct tgtttctccc 4 081 
aagtggggag cgggagcagg tgagggagag aggagagggc atgagccagc ccccccctcc 4141 
cgatttcccc gtaaagtgat gcggccccat gtccctcctt gttcccagag gaacctgggg 4201 
cccgctcctc ccccctccag gccatgagga ttctgcagtt aatcctgctt gctctggcaa 4261 
caggtacgca ggggatgggg gcagggcagg atcctccctc ttgaatctct gggatcccct 4321 

50 aaccctctgt gtctggacag tgacagggct gattccaaat tacagaacaa cccataaggc 4381 
acctgaactg gagcagtggt catgagggcc tggatgccct tctagataat ccctttaaat 4441 
gccaaaggag gagaggtcaa gggggtcgta aagggtcccg tggaggggct gaggaagctg 4501 
gagttggggg agcagtcact caaagcgccc aggacagggg ctactgacca accagtatgg 4561 
aagtatttcc tttttttttt ttcccagaga caaagtcttg ctctattgtc caggctggag 4 621 

55 tgccgtggtg ccaacacggc tcactgcagt cttgacttcc cgggcttaag tgatccttaa 4 681 
gccatctcag cttccccggt agctgggacc acaggcacct gccaccaagc caggctaatt 4741 
gtttaattgt ttgtagagat gggggaggag gtctcactat gtttgcctag gctgatctag 4 801 
aactcctggg ctcaagtaat cctcccacct tagcctctca aagtgctggg attacaggca 4861 
tgagccactg catttgacct tatggaagta ttttcatcct ttaatacccg accccagcat 4921 
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ccagggcaac ccagagggac accagaccag ggcccagacc acccactctc tttctctcct 4981 
ccccaccccc atttctggga gtcctcctgg tctaccacct ctccttcctg agccccttct 5041 
tttgctctca ccccctccag ggcttgtagg gggagagacc aggatcatca aggggttcga 5101 

gtgcaagcct cactcccagc cctggcaggc agccctgttc gagaagacgc ggctactctg 5161 

5 tggggcgacg ctcatcgccc ccagatggct cctgacagca gcccactgcc tcaagccgtg 5221 

ggtgcggggg ctggggcggt gccggggtgg ggggctggga atggggagat ggatggagag 52 81 

aagctcaggg ataggggtgc tggtaagggg attagagatg gggatgggta gtgtcagcaa 5341 

ggttgatggg ctcgagttgg tattgaaggt ggggggatga atggggttgg gatggggcta 5401 

tggctgggaa gggggcttcg gtgggagacg tggaagaggt tggaagcaga gcgatgtttc 5 4 61 

10 ttcatcctca aaggtgtcac tcacctctcc cacccatgtc tcccccgacc tttcctcctc 5521 

caactactgt ctctcccacc tcagccgcta catagttcac ctggggcagc acaacctcca 5581 

gaaggaggag ggctgtgagc agacccggac agccactgag tccttccccc accccggctt 5641 

caacaacagc ctccccaaca aagaccaccg caatgacatc atgctggtga agatggcatc 5701 

gccagtctcc atcacctggg ctgtgcgacc cctcaccctc tcctcacgct gtgtcactgc 5761 

15 tggcaccagc tgcctcattt ccggctgggg cagcacgtcc agcccccagt gtaggagcac 5821 

cagaggggaa cctggcaggg ggtggtgagg agggagtggt caggattgtg gaagggttca 5 8 81 

gggcatcaga gatgcggttc acagtgacga tgtgggataa gttgagagga tgtgtggaaa 5941 

acgtcaggat aggggggtgg ggacaaaagt tggggccttg gagtcagacg gacgggatat 6001 

gcaatcatac atccataacc tcctggttgt aagaccttag gcaagcagct tcacctctct 6061 

20 gaatcttgat tttcttctct ataaaatgag aatgattata cccacctgtc aggattggat 6121 

tagagataat gtatatcaag caactgacat aaatcattta ttggatagca ggctgggcac 6181 

cgtggctcac gcctgtaatc ccagcacttt gggaggccga ggtgggaaga tcacctgagg 6241 

tcaggacttt gataccagcc tggccaacgt ggtgaaatcc catctctact aaaaatgtga 6301 

aaattagttg ggcgtggttg tgtgcgcctg taatcccagc tactcgggag gttgaggcag 6361 

25 gagaatcgct taaacttggg agacggaggt tgcagtgagc caagatcacg ccactgcact 6421 

ccagcctggg caacagagca agactctgtc tcgaaaaaaa aaaaaaaaaa gctggatagc 64 81 

attgctgttg ctattgttac aagaagagag gtgagttggc tgcgtctaag gacagggatt 6541 

cccccagggg cgggatcaca gcaagcactg cattagggga ggtggcaggg ggctcattcc 6601 

cacagcccct cacgctgttt ccacagtacg cctgcctcac accttgcgat gcgccaacat 6661 

30 caccatcatt gagcaccaga agtgtgagaa cgcctacccc ggcaacatca cagacaccat 6721 

ggtgtgtgcc agcgtgcagg aagggggcaa ggactcctgc caggtcagtg tggtctccaa 6781 

ccacagcccc atccccatcc ccagcttcaa tgacatcttt accgacatcc acaatttcat 6841 

ccccaacctc aacccgccga cccctgcaac tcccaatcca tctcttcccc tgttcccgtt 6901 

tctgacctca gcacaaactt cagctccatc cccgtttcca caccatttcc agctccaacc 6961 

35 atccccaaac tcgtttttga gcctaacccc atcctttatc ccacccataa tcccagcttt 7021 

atcgctaaac ctatcacctt tcccagtgcc tacccatcct gtctcggccc cactcctaag 7081 

caccgtcccc acctcctccc tggctaacac catgctcaac gctttctctg accgacattc 7141 

tctctccccg tgcccagggt gactccgggg gccctctggt ctgtaaccag tctcttcaag 7201 

gcattatctc ctggggccag gatccgtgtg cgatcacccg aaagcctggt gtctacacga 72 61 

40 aagtctgcaa atatgtggac tggatccagg agacgatgaa gaacaattag actggaccca 7321 

cccaccacag cccatcaccc tccatttcca cttggtgttt ggttcctgtt cactctgtta 7381 

ataagaaacc ctaagccaag accctctacg aacattcttt gggcctcctg gactacagga 7441 

gatgctgtca cttaataatc aacctggggt tcgaaatcag tgagacctgg attcaaattc 7501 

tgccttgaaa tattgtgact ctgggaatga caacacctgg tttgttctct gttgtatccc 7561 

45 cagccccaaa gacagctcct ggccatatat caaggtttca ataaatattt gctaaatgag 7 621 

tgaatctact gagtgcttac tatgtgctag accctgatcc aatggctttt attttatttt 7 681 

attttttgac agagtctcgc tctgtcaccc aggctggagt acagtggtgc tatctctgct 7741 

cactgcaacc tccacctcct gggttcaagc aattctcctg cctcagcctc ctgaatagct 7 8 01 

gggattacag gtgcctacca ccacatccgg ctaatttttg tattttttag tagagatggg 7 8 61 

50 gcttcaccat gttggccagg ctggtctcga actcctgacc tcagatgatc tgccctcctt 7 921 

ggcctcccaa actcctggga ttacagacgt gagccaccgc gcccgcccgg ctttcattta 7 981 

ttaattaaaa gaaattaaat taattaatct atttaggaga cagtcttgct ctgttgccca 8041 

ggctggagtg cagtaacaat cacagctcac ggcaatctca atttcctggg gtcaagtgat 8101 

tgtcctccct cagcctccag agtagctggg actacaggca catgccacga agcccagcta 8161 

55 atttttgtat ttttcgtaga gacagaggtc tcagtatgtt gccccggcta gtctcaaact 8221 

cctgggctca agcagtctgt cctcctcagc ctccaaaagt ggtgagatta caggcatgag 8281 

tcgctgtgcc tggcctccaa gcactttcaa atgtatcaac ttaatcctca caaaaccctg 8341 

tgaggtcggt actgttttca tacctatttt atagttgaag aaacagacac agagaagcaa 8401 

agtcacttgc tcacagtcac gtggctagga gagcaaggat ctgaagcaag gcgatctctt 8 4 61 
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aattaccaag 
catgcaaaag 
accaaaggag 
tctgctggcc 
5 agccagaaat 
tgactggtgg 
agcagcactg 
ggatcttttt 
cagtggcgtg 
10 ctcagcctcc 
gtctttttag 



tgatgttcct 
tataacacag 
cttattgttt 
aatatggtag 
gtaactcctc 
tggctggatt 
agttagatgg 
tttttttttt 
atctcggctc 
ctagtagctg 
tagagatggg 



ggagtaaggc 
taagtaaaga 
ttctccaact 
ccactaaccg 
tgtctcactt 
agctagtgca 
cctctgaaga 
ctttgcgaca 
actgcaactt 
ggactacagg 
gtttcactat 



tctgtttgtt 
agtcagttag 
tccatgatag 
cagctggctc 
gttatatctc 
tataaaacat 
ggatgtccca 
gagtcttgct 
ctgcctccca 
cctgtgccaa 
gttggccagg 



tcctttcctg 
cctgcacata 
gtaattagat 
ttccaattaa 
caaggctgga 
cactgcagaa 
cggagagaat 
ctgtcaccca 
ggttcaagca 
catccccagc 
ctggtctcga 



taaaatgctg 
ctaagaccta 
agtggagacc 
aattacataa 
tagccacatg 
agttcagctg 
ccagaactca 
ggctggagtg 
attctcctgc 
taatttttgt 
actcctgacc 
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8821 
8881 
8941 
9001 
9061 



